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ABSTRACT 


We  consider  a  situation  where  two  processors  P\  and  P j  are  to  evaluate  a  collection  of 
functions  m.,  ■  o{  two  vector  variables  x,  y,  under  the  assumption  that  processor  P^ 
(respectively,  Pt)  has  access/only  to  the  value  of  the  variable  z  (respectively,  y)  and  the 
functional  form  of  We  provide  some  new  bounds  on  the  communication  com¬ 

plexity  (the  amount  of  information  that  has  to  be  exchanged  between  the  processors)  for 
this  problem.  An  almost  optimal  bound  is  derived  for  the  case  of  one-way  communication 
when  the  functions  fo,.. A  are  polynomials.  We  also  derive  seme  new  lower  bounds  for 
the  case  of  two-way  communication  which  improve  on  earlier  bounds  by  Abelson  [A  80\ 
As  an  application,  we  consider  the  case  where  x  and  y  are  n  x  n  matrices  and  /(x,y)  is  a 
particular  entry  of  the  inverse  of  x  +  y.  Under  a  certain  restriction  on  the  class  of  allowed 
communication  protocols,  we  obtain  an  fl(n?)  lower  bound,  in  contrast  to  the  f^(n)  lower 
bound  obtained  by  applying  Abelson ’s  results.  Our  results  are  based  on  certaui  tools  from 
classical  algebraic  geometry  and  field  extension  theory.  ^','c  * - — 
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1  Introduction. 


In  several  situations  of  practical  interest  there  is  a  set  of  processors  who  wish  to  perform 
some  computational  task  and  who  must  communicate  because  none  of  them  possesses  all 
of  the  problem  data.  Communication  resources  are  often  limited  and  w»  are  led  to  the 
study  of  the  minimal  amount  of  required  information  transfer,  that  is,  the  “communica¬ 
tion  complexity”  of  the  problem  under  consideration.  For  example,  in  parallel  computation 
[BT  89],  communication  is  often  much  slower  than  computation  and  excessive  communica¬ 
tion  may  create  bottlenecks  to  the  speed  of  an  algorithm.  A  similar  argument  applies  to 
computations  using  special  purpose  VLSI  chips  [U  84]  in  which  communications  capabili¬ 
ties  are  constrained  by  physical  and  topological  considerations.  Finally,  there  are  several 
applications  in  signal  processing:  for  example,  in  decentralized  estimation  and  detection, 
or  in  distributed  sensor  networks  [TS  81],  data  are  collected  at  geographically  distant  sites. 
Then,  summaries  of  the  data  are  communicated  so  as  to  enable  a  particular  processor  or  sen¬ 
sor  to  make  certain  statistical  inferences  (see  e.g.  [WB  82]).  Communication  resources  are 
often  costly  in  such  contexts,  and  it  is  again  natural  to  minimize  the  amount  of  information 
exchange. 

1.1  Communication  Protocols. 

In  this  subsection,  we  introduce  the  class  of  protocols  that  will  be  considered  and  we  for¬ 
mulate  the  general  problem  to  be  studied. 

Let  there  be  two  processors  Pj  and  Pj.  Processor  P\  (respectively,  P{)  has  access  to 
the  value  of  a  vector  z  £  5Rm  (respectively  y  €  9?n).  Let  there  be  given  a  finite  collection  / 
of  functions  Dj  »-»  5R,  where  Dj  is  some  subset  of  5Rm  x  5tn  on  which  these 

functions  are  defined.  (For  example,  if  each  fi  is  a  rational  function  expressed  as  a  ratio  of 
two  relatively  prime  polynomials,  it  is  natural  to  let  Dj  be  the  set  of  all  vectors  at  which 
none  of  the  denominators  of  these  functions  vanishes.) 

The  objective  of  the  processors  is  to  exchange  messages  and  compute  the  values  /i(x,y), 
. . . ,  f,{x,  y).  It  is  assumed  that  both  processors  know  the  formulas  defining  these  functions. 
(For  instance,  if  each  /,  is  a  polynomial,  then  each  processor  knows  the  coefficients  of  these 
polynomials.)  Ideally,  a  protocol  should  work  for  all  possible  values  (x,  y)  £  Dj  of  the 
“inputs”.  We  will  occasionally  consider,  however,  protocols  which  are  defined  only  when 
(x,y)  belongs  to  some  possibly  smaller  set  D  C  Dj. 

In  a  two-way  communication  protocol  ir,  messages  can  be  exchanged  in  both  directions. 
We  use  r(n)  to  denote  the  number  of  exchanged  messages  and  we  let  7j_2  (respectively, 
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T2_i)  denote  the  set  of  i’s  for  which  the  tth  message  is  transmitted  from  Pi  to  Pi  (re¬ 
spectively,  from  P2  to  Pi).  The  protocol  is  defined  in  terms  of  a  collection  of  functions 
mapping  a  set  D  C  Dj  into  5R.  (In  particular,  m,(x,y)  is  the  value  of  the 
ith  message  and  the  set  D  is  called  the  domain  of  the  protocol.)  Since  a  message  by  a 
processor  can  only  be  a  function  of  the  information  available  to  that  processor,  we  impose 
the  requirement  that  for  each  t,  there  exists  some  real-valued  function  m,  such  that 

m,(x,y)  =  m,  (x,mi(x,y),...,m<-1(x,y)),  V(x,y)€Z?,  if  i  €  7i_2,  (1.1) 

and 

mi(z,y)  =  rh,  (y,mi(x,y),...,mt_i(x,y)),  V(x, y)  €  D  if  i  G  T2-i.  (1.2) 

We  say  that  the  protocol  is  legitimate  if  either  of  the  following  conditions  is  true: 

a)  There  exist  functions  hi, . . . ,  hB  such  that 

/,(x> y)  =  h ,  (x,m1(x,y),...,mr(,)(x,y))  ,  V(x,y)  e  D,  i  =  l,...,s.  (1.3) 

(This  corresponds  to  the  case  where  processor  Pi  evaluates  the  final  result.) 

b)  There  exist  functions  hi, . . . ,  h,  such  that 

/i(x> y)  =  hi  (y>»"i(*,v),...,  »*,■(*)(*»  v)) ,  v(x,y)  e  d,  i  =  i,...,«.  (1.4) 

Let  II (/;  D,  *-*)  denote  the  clfiss  of  all  legitimate  two-way  protocols,  with  domain  D,  for 
computing  the  functions  /i, . . . ,  /»,  subject  to  some  additional  restrictions  to  be  introduced 
later.  We  define  the  two-way  communication  complexity  C(/;  £>,<-*)  for  computing  /  on 
the  domain  D  to  be 

C{1\D,~)=  inf  r(x). 

iren(/;D,~) 

The  definition  of  an  one-way  communication  protocol  x  is  identical,  except  that  messages 
can  only  be  transmitted  by  processor  Pi.  That  is,  the  set  T2_i  is  assumed  empty.  Let 
II(/;  Z?,  — 1 ►)  denote  the  set  of  all  legitimate  one-way  communication  protocols  with  domain 
D.  We  define  the  one-way  (from  Pi  to  P2)  communication  complexity  C(f  \D,—*)  on  the 
domain  D  to  be 

C(/;  £>,-»)=  inf  r(x). 

*en(/;D,-) 

Notice  that  in  the  above  models  the  protocols  are  “continuous”  in  the  sense  that  the 
messages  to  be  sent  are  real  numbers.  Given  that  real  numbers  can  only  be  encoded  with 
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an  infinite  number  of  bits,  such  protocols  might  seem  impossible  to  implement  in  practice. 
However,  parallel  and  distributed  numerical  algorithms  are  almost  always  described  and 
analyzed  as  if  real  numbers  can  be  communicated,  with  the  understanding  that  in  practice 
these  numbers  will  be  encoded  with  a  finite  number  of  bits  which  is  sufficient  to  obtain  a 
desired  accuracy.  Furthermore,  if  the  messages  being  transmitted  are  rational  functions  of 
the  data  and  if  the  data  consist  of  rational  numbers,  then  an  implementation  using  a  finite 
number  of  bits  is  clearly  possible.  Finally,  in  practice,  it  is  usually  the  case  that  a  field  of  a 
fixed  length  is  used  for  transmitting  an  encoded  version  of  a  real  number.  For  this  reason, 
it  is  reasonable  to  count  the  number  of  real-valued  messages  being  transmitted,  as  opposed 
to  counting  individual  bits.  Our  model  is  therefore  a  fairly  realistic  way  of  capturing  the 
communication  resources  needed  in  a  number  of  practical  applications. 

Typically,  some  smoothness  constraints  have  to  be  imposed  on  the  message  functions 
mi, ... ,  This  is  because  there  exist  one-to  one  functions  from  5Rm  into  ?R,  and  pro¬ 

cessor  Pi  could  transmit  the  value  of  its  vector  x  by  using  a  single  message.  In  particular, 
Pi  can  simply  interleave  the  binary  expansions  of  the  components  of  x  and  use  the  re¬ 
sulting  number  as  a  message.  This  is  not  a  useful  protocol,  for  the  purposes  of  numerical 
computation,  and  is  unlike  any  protocol  that  is  used  in  practice.  In  contrast  to  the  above 
described  interleaving,  a  good  protocol  should  compress  the  information  in  x  or  y  intelli¬ 
gently,  and  then  transmit  only  the  compressed  information.  For  this  reason,  we  shall  impose 
some  smoothness  requirements  on  the  message  functions  m,  .  From  a  technical  point  of  view, 
smoothness  assumptions  prohibit  the  use  of  one-to-one  functions  from  5Rm  into  if  m  >  1. 
From  a  practical  point  of  view,  such  smoothness  is  present  in  the  vast  majority  of  practical 
numerical  methods  for  algebraic  problems.  Furthermore,  in  this  paper,  we  concentrate  on 
the  case  where  each  one  of  the  functions  in  /i,...,/«  is  rational.  It  is  then  natural  to 
restrict  attention  even  further  to  protocols  involving  only  rational  functions  of  the  data. 
This  is  equivalent  to  an  assumption  that  each  processor  can  only  perform  the  elementary 
arithmetic  operations.  Such  an  assumption  is  common  in  complexity  studies  for  algebraic 
problems. 

In  the  sequel  we  use  the  shorter  notations  U(J;D)  and  C[]\D)  whenever  it  is  clear 
from  the  context  whether  we  are  dealing  with  one-way  or  two-way  protocols.  Furthermore, 
we  use  the  notation  n(/;  D)  and  C(/;  D)  whenever  a  =  1  and  the  collection  /  of  functions 
consists  of  the  single  function  /. 

In  this  paper,  we  will  consider  various  restrictions  on  the  set  of  allowed  protocols.  We 
indicate  these  restrictions  in  our  notation  as  shown  in  Table  1: 
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Notations 


Restrictions  on  the  message 
functions 

(cf.  Eqs.  (1.1)— (1-2)). 


Restrictions  on  the  final 
evaluation  functions 
hly...,ht{d.  Eqs.  ( 1 .3)-(  1 .4) ) . 


I Mf,D) 
n2(J,D) 
n°c{f,D) 
n  ratCf^D) 
flpo/y  (/>  D) 
niinear  (/)  D) 


continuously  differentiable 

twice  continuously  differentiable 

infinitely  differentiable 

rational 

polynomial 

linear 


continuously  differentiable 

twice  continuously  differentiable 

infinitely  differentiable 

rational 

rational 

polynomial 


Table  1 


We  use  notation  like  Cj(/;  D),  D),  etc.,  to  denote  the  communication  complexity 

under  the  restrictions  on  the  protocols  introduced  in  Table  1.  Notice  that  as  we  go  down 
the  table  additional  restrictions  are  introduced  and,  therefore,  the  corresponding  commu¬ 
nication  complexity  can  only  increase.  Finally,  assuming  that  D  is  a  nonempty  open  set, 
we  see  that  the  set  Ylrat{f  \D)  (respectively,  rii,neaf  (/;  D))  is  empty  unless  /  is  a  rational 
(respectively,  polynomial)  function. 

All  of  our  definitions  can  be  extended,  in  the  obvious  way,  to  the  case  where  the  real 
number  field  SR  is  replaced  by  the  complex  field  C.  Here,  all  the  functions  /,  are  defined 
on  a  subset  Dj  of  Cm  x  Cn  and  take  values  in  C.  Furthermore,  a  protocol  has  a  domain 
D  c  Cm  x  Cn  and  the  message  functions  m,  and  m,  [cf.  Eqs.  (1.1)— (1.2))  are  defined  on  D. 


1.2  Related  Research. 

The  problem  formulation  we  are  using  is  due  to  Abelson  ([A  78],  [A  80])  who  established 
lower  bounds  on  one-way  and  two-way  communication  complexity,  assuming  that  the  mes¬ 
sage  functions  are  once  (respectively,  twice)  continuously  differentiable.  (These  results  are 
stated  and  discussed  in  Sections  3  and  5,  respectively.) 

Communication  complexity  has  also  been  studied  under  discrete  models  of  communica¬ 
tion.  In  these  models,  the  messages  exchanged  are  binary  and  the  functions  evaluated  are 
such  that  a  finite  number  of  binary  messages  are  actually  sufficient.  For  example,  [Y  79] 
and  [PS  82]  consider  the  computation  of  Boolean  functions  using  binary  messages.  The 
approach  in  these  references  is  combinatorial  in  nature  and  very  different  from  ours.  A 
fair  amount  of  research  has  dealt  with  extensions  of  the  results  of  [Y  79]  and  with  the 
evaluation  of  the  communication  complexity  of  selected  combinatorial  problems  ([AU  83], 
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[MS  82],  [PE  86],  [PS  82],  [PT  82],  [U  84]).  A  different  framework  is  considered  in  [TL  87] 
for  the  problem  of  approximately  minimizing  (within  a  desired  accuracy)  the  sum  of  two 
convex  functions,  with  each  function  known  by  a  different  processor.  Here,  the  objective  is 
to  minimize  the  number  of  binary  messages,  as  a  function  of  the  desired  accuracy  of  the 
solution. 

1.3  Outline  of  the  Paper. 

The  rest  of  this  paper  is  organized  as  follows.  In  Section  2,  we  present  some  background 
results  from  field  extension  theory  that  will  be  used  in  our  study  of  one-way  communication 
complexity. 

In  Section  3,  we  study  the  one-way  communication  complexity  of  computing  a  set 
/i,  •••)/«  of  polynomials.  The  results  of  [A  78]  (stated  in  Subsection  3.1)  provide  a  complete 
solution  for  the  case  of  a  single  function  /,  smooth  message  functions,  and  protocols  whose 
domain  is  a  (possibly  very  small)  open  set.  We  extend  these  results  to  the  case  of  e  >  1. 
We  also  show  that  we  can  restrict  to  the  class  of  polynomial  protocols  while  increasing 
the  communication  complexity  by  at  most  one.  Furthermore,  the  polynomial  protocols  we 
construct  have  a  domain  which  is  almost  all  of  9?m  X  9?"  (except  for  a  set  of  measure  zero). 
We  also  consider  the  special  case  where  m  =  n  and  each  one  of  the  polynomials  /,  :  9?"  x  9?n 
is  of  the  form  /,-(*,  y)  =  /<(x  +  y),  where  each  is  a  polynomial  in  n  variables.  For  this 
case,  we  obtain  a  complete  characterization  of  the  communication  complexity,  a  proof  that 
linear  protocols  are  optimal,  and  a  constructive  procedure  for  designing  such  protocols. 

In  Section  4,  we  present  some  background  from  algebraic  geometry  (e.g.  Hilbert’s  Null- 
stellensatz)  that  will  be  needed  later. 

In  Section  5,  we  derive  several  general  lower  bounds  on  two-way  communication  com¬ 
plexity  of  computing  a  rational  function  /  when  the  messages  are  constrained  to  be  rational 
functions  of  the  data.  Our  results  are  obtained  by  combining  an  earlier  result  of  Abelson 
[A  80]  with  the  tools  of  Section  4.  We  also  identify  certain  instances  where  the  lower  bounds 
of  [A  80]  are  tight. 

In  Section  6,  we  apply  the  results  of  Section  5  to  the  problem  of  computing  a  particular 
entry  of  the  inverse  of  x  +  y,  where  x  and  y  are  n  x  n  complex  matrices.  We  derive  an  n2  -  1 
lower  bound  (which  agrees  with  the  obvious  upper  bound,  within  one  message),  while  the 
results  of  [A80]  could  only  provide  an  fi(n)  lower  bound. 
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2  Preliminaries. 


In  this  section,  we  introduce  some  algebraic  results  (see  e.g.  [ZS  65,  pages  95-125]  or 
[VW  53])  that  will  be  needed  in  Section  3. 

Notation:  Let  {a,  :  i  6  /}  be  a  collection  of  vectors  in  3Rn,  where  I  is  a  finite  index  set. 
We  use  [a,  :  »  €  I]  to  denote  the  matrix  with  columns  a,,  t  £  I.  Whenever  the  range  of  the 
index  variable  i  (that  is,  the  index  set  7)  is  evident  from  the  context,  we  use  the  simpler 
notation  [a*  :  »'].  For  any  function  /  :  SR”  •— *  3?,  we  use  V/  to  denote  the  vector-valued 
function  whose  components  are  the  partial  derivatives  of  /.  We  also  use  V/(p)  to  denote 
the  value  of  V/  evaluated  at  some  p  £  SR". 

Definition  2.1  Let  F\  and  Fi  be  two  fields.  We  say  that  F 2  is  an  extension  of  F\,  denoted 
by  F2/F1,  if  F\  is  a  subfield  of  Fj.  An  element  X  £  Fj  is  said  to  be  algebraic  over  Fi  if  X 
satisfies  a  relation  /(A)  =  0,  where  f  is  a  polynomial  with  coefficients  taken  from  Fi-  We 
say  that  F 2  is  an  algebraic  extension  field  of  F\  if  all  the  elements  of  F2  are  algebraic  over 
Fi-  Otherwise,  we  say  that  F2  is  a  transcendental  extension  field  of  F\. 

Let  F\  be  a  subfield  of  some  field  F.  A  typical  way  of  constructing  an  extension  field  of 
Fi  is  by  adjoining  to  F\  some  elements  A,  £  F  that  do  not  belong  to  Fj  (1  in  some  index  set 
A).  Consider  the  set  of  all  subfields  of  F  that  contain  F\  and  A,  («  £  A).  The  intersection 
of  all  of  these  fields  is  still  a  field  and  is  the  smallest  field  containing  Fi  and  the  A*’s.  This 
field  is  called  the  field  generated  by  the  A/s  and  will  be  denoted  by  F2  —  Fi({A,, «  £  .£}). 
When  the  cardinality  of  A  is  finite,  we  say  that  F2  is  a  finitely  generated  extension  field  of 

Fi 

Definition  2.2  We  say  that  F2  is  a  finite  algebraic  extension  of  the  field  F\,  if  the  extension 
Fj/Fj  is  algebraic  and  the  dimension  of  Fj,  when  regarded  as  a  vector  space  over  F\,  is  finite. 

Definition  2.3  Let  F2/F1  be  a  finite  algebraic  extension  and  let  X  be  an  element  of  F2. 
We  say  that  X  is  a  primitive  element  of  the  extension  F2/F1  if  F2  =  Fi(A),  i.e.,  if  F2  is 
generated  by  X  over  the  field  F|.  In  this  case,  we  say  that  Fj  is  a  simple  extension  of  Fj. 

The  notion  of  a  finite  algebraic  extension  is  different  from  the  notion  of  a  finitely  generated 
extension.  For  example,  5R(x)  is  a  finitely  generated  field  over  8?  but  not  a  finite  algebraic 
extension  since  3?(x)/9i  is  a  transcendental  extension.  However,  the  following  theorem  states 
that  this  is  the  only  type  of  counterexample  (see  [ZS  65,  pages  60-61]). 

Theorem  2.1  Every  finitely  generated  algebraic  extension  is  finite. 
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Definition  2.4  Let  F  be  a  field  and  let  F\x\  be  the  ring  of  polynomials  with  coefficients  in 
F.  The  differentiation  operator  ^  is  the  mapping  of  F[x]  into  itself  defined  in  terms  of  the 
following  properties: 


\i= 0 


where  n  >  0  and  a,  £  F,  i  =  0, . . . ,  n. 


We  note  that  when  F  is  equal  to  !R,  the  above  definition  coincides  with  the  usual  notion  of 
differentiation. 

Definition  2.5  Let  F2/F1  be  an  algebraic  extension  and  let  X  be  an  element  in  Fz-  The 
minimal  polynomial  of  X  is  a  polynomial  f  £  Ei\x]  of  the  smallest  degree  such  that  /  (A)  =  0. 
The  element  A  is  called  separable  over  Fi  if  there  holds  (a?(/))  (A)  ^  0,  where  f  is  the 
minimal  polynomial  of  X.  F2  is  called  a  separable  algebraic  extension  if  all  the  elements  of 
F2  are  separable  over  F\. 

The  following  result  (see  e.g.jZS  65,  page  84])  is  called  the  theorem  of  primitive  element 
and  will  be  used  in  Section  3. 


Theorem  2.2  Every  finite  separable  algebraic  extension  Fj/Fi  has  a  primitive  element. 
Hence,  every  such  extension  is  a  simple  extension.  Furthermore,  if  F2  =  F\[ Ai,...,A*), 
then  there  exists  a  primitive  element  of  the  form  X  =  X);  =  i'7>A>  where  7;  £  F\  for  each  j. 

Remark:  In  fact,  the  proof  of  Theorem  2.2  given  in  [ZS  65,  page  84]  shows  that  a  primitive 
element  A  is  obtained  for  any  arbitrary  choice  of  the  coefficients  71, ...  ,7*  as  long  as  they  do 
not  lie  in  the  zero  set  of  a  certain  polynomial.  As  an  illustration,  notice  that  3?(l+«,  1  —  *')  =  C 
is  a  finite  separable  algebraic  extension  over  5R.  By  Theorem  2.2,  there  exists  a  primitive 
element  which  can  be  taken  as  a  linear  combination  of  1  +  « ,  1  —  ».  In  particular,  one  has 
5R(1  +  i,l-  1)  =  SR(7i(1  +  t)  +  72(1  -  *))  for  some  suitable  choices  of  real  numbers  71,72. 
It  is  not  hard  to  see  that  all  of  the  fields  3?(7i(1  +  «)  +  72(1  -  «*))  are  equal  to  C,  as  long  as 
'll  =£  'll- 

We  now  turn  our  attention  to  the  case  of  transcendental  extensions. 

Definition  2.6  Let  Fz/F\  be  a  field  extension.  The  transcendental  degree  of  F2/F1  is 
defined  as  the  smallest  number  t  such  that  there  exist  elements  Aj,  A2, . . . ,  At  in  F2  with  the 
property  that  F2  is  an  algebraic  extension  of  Fi(Xi,X2,  ...  ,Xt).  In  particular,  if  F2/F1  is 
an  algebraic  extension  to  start  with,  then  its  transcendental  extension  degree  is  zero.  The 
transcendental  degree  will  be  denoted  by  tr.d .F2/F1  and  the  elements  Ai,A2,...,At  will  be 
called  a  transcendental  basis  of  F2/ F\. 
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In  light  of  the  above  definition,  9?(zi ,  zi,  ■ . . ,  xm )  (the  field  of  rational  functions  over  5R 
with  indeterminates  xi,X2,  ■  ■  ■  ,  xm)  is  a  transcendental  extension  of  5R  with  degree  m  and 
11,12, ...  ,xm  can  be  taken  as  a  transcendental  basis.  The  following  theorem  summarizes 
some  important  properties  of  the  transcendental  degree  of  a  field  extension. 

Theorem  2.3  Let  Fj  be  a  finitely  generated  extension  field  of  F\  and  let  F$  be  a  finitely 
generated  extension  field  of  F 2.  (In  particular,  F3  is  also  a  finitely  generated  extension  field 
of  F\.)  Suppose  that  F$  =  Fi( Ai,  A2, . . . ,  An)  and  that  tr.d.Fs/Fj  =  t.  Then, 

t  —  tr.d-.F3/ F\  —  tr.d.F’3/ F2  +  tr.d  .F2/F1. 


The  following  is  the  definition  of  a  derivation  over  a  field,  which  is  a  generalized  notion 
of  differentiation. 

Definition  2.7  Let  F2  be  a  finitely  generated  extension  field  of  F\  and  let  F3  be  an  extension 
field  of  Fj.  A  mapping  D  of  F2  into  F3  is  said  to  be  an  F\-derivation  of  Fj  (with  values  in 
F3)  */<  for  every  A  in  Fi  and  every  x,  y  in  F2,  the  mapping  D  has  the  following  properties: 

1.  D(X)  =  0; 

*■  D(x  +  y)  =  D(x)^D(y); 

S.  D(xy)  =  xD(y)  -f  y D(x). 

Notice  that  the  derivations  are  defined  in  a  way  that  is  very  similar  to  differentiations. 
As  a  result,  one  can  show  that  the  well  known  chain  rules  remain  true  for  derivations.  We 
now  let  Pf3/Fi(Fz)  stand  for  the  space  of  all  Fi -derivations  of  F2  with  values  in  F3.  Then 
pFj/Fii^s)  can  viewed  as  a  vector  space  over  F$  in  a  natural  way  since  one  can  easily 
verify  that  Df3/f,(F 3)  is  closed  under  linear  combinations  over  F3.  It  ran  be  shown  (see 
[ZS  65,  pagesl2Q-127j)  that  the  dimension  of  the  vector  space  Pf7/Fi(^3)  d°es  not  depend 
on  the  particular  choice  of  Fj.  It  is  for  this  reason  that  we  usually  drop  Fj  from  the  notation 
Dpt/Fti?' s)  a°d  use  simply  Pf3/fx  to  denote  the  space  of  Fi-derivations  of  F2  with  values 
in  any  extension  field  of  F2. 

Definition  2.8  Let  F  be  a  field  whose  multiplication  identity  is  denoted  by  e.  IfY,?=  1  e  0 
for  all  positive  integers  n,  we  say  that  F  has  characteristic  0. 

For  example,  the  fields  9?  and  C  have  characteristic  0.  In  fact,  every  extension  field  of  9? 
has  characteristic  0  since  it  shares  the  same  identity  element  with  9?.  The  following  result 
is  quoted  from  [ZS  65,  page  125], 
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Theorem  2.4  Let  F2  be  a  finitely  generated  extension  field  of  F\  and  let  F3  be  a  finitely 
generated  extension  field  of  F2.  If  F2  has  characteristic  zero,  then  each  derivation  D  £ 
Dfi/Fx  can  be  extended  to  a  derivation  D  in  PF3/Fr 

Example:  We  now  consider  in  some  detail  the  space  of  derivations  for  an  important  spe¬ 
cial  case  and  derive  a  result  that  will  be  needed  in  Section  3.  Let  Fi  =  and  let  F3  — 
3? (x  1 ,  X2, .  ■  ■  xm),  the  field  of  rational  functions  over  3?  with  indeterminates  x\,  X2,  ■  ■  . ,  xm. 
Furthermore,  we  let  F2  be  the  subfield  of  F3  which  is  generated  by  polynomials  /1,  ft, . . . ,  fn  £ 
F3.  In  other  words,  F2  is  the  set  of  all  rational  functions  that  can  be  expressed  as  rational 
functions  of  the  fj’s.  It  can  be  readily  verified  that  the  partial  derivatives  defined  by 

fc)  (l;)  =  6>k' 

are  in  Pf3/f,  (F3),  where  Sjk  is  the  Kronecker  delta.  This  implies  that  for  any  D  £  £>f3/F1(F 3) 
the  derivation  ( D  -  £r=i  $—)  maps  xl,...,xm  to  zero.  Hence  it  maps  F3  to  zero. 

In  other  words,  we  have 

m  o 

D  =  EDM^r- 

Hence  D  is  completely  determined  by  the  choices  of  D(x *)  £  F3,  k  =  l,2,...,m,  and 
•  •  •  >  is  a  basis  for  ^f3/fi(f3)-  Now  suppose  that  D  £  Pf2/f1{fs)-  Since  F2 
has  characteristic  0,  by  Theorem  2.4,  we  see  that  D  can  extended  to  a  derivation  D  in 
(F3).  From  the  above  discussion,  we  see  that 

°  =  (2.1) 
k=l  dx * 

Therefore,  the  map  D,  which  is  equal  to  the  restriction  of  D  on  F2,  can  be  written  as  a  linear 
combination  of  the  gf-’s  (cf.  Eq.  (2.1)).  Conversely,  for  each  choice  of  £>(x*)  £  F$,  Eq. 
(2.1)  defines  a  derivation  in  Pf^/F^Fs)-  However,  two  different  choices  of  D( x*)  may  give 
rise  to  the  same  derivation  in  Pf2/Fi{F s)-  As  a  matter  of  fact,  any  /  £  F2  can  expressed  in 
the  form  of  /  =  g{fi,  /2, . . . ,  /„),  where  g (zj,  z2, . . .  z„)  is  a  rational  function.  By  the  chain 
rule,  we  have 

DU)  =  £m)+§lDlh)  +  -+§lD(M, 

where  is  the  partial  derivative  of  g  with  respect  to  Zj  defined  in  the  usual  sense.  Since 
the  ’s  are  independent  of  D,  we  see  ihat  D  is  completely  determined  by  its  operation 
on  /;  >  J  =  1, 2, . . . ,  n.  Moreover,  since  the  /; ’s  belong  to  F2  we  see  that  different  choices  of 
the  D{f}),s  will  result  in  different  derivations  in  Pf^/f^ 
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We  now  develop  an  explicit  formula  for  the  dimension  of  (Eq.  (2.4)  below),  in  the 

context  of  the  particular  example  we  have  been  considering.  This  formula  will  be  crucial 
for  the  results  of  Section  3. 


Notice  that  for  every  j  and  any  D  G  Df2/Fi<  one  has 

DUi)  = 


We  now  rewrite  Eq.  (2.2)  in  the  matrix  form 

D(/2), ....  D{fn))  =  [V/,-  :  j  e  </], 

where  J  =  {1,2,,..,  n}.  Since  Z?(x*)  can  be  taken  arbitrarily,  we  see  that  the  vector  space 
^F2/ Flip's)  is  isomorphic  to  the  space  spanned  by  the  rows  of  the  matrix  [V /;  :  jj.  Hence 

dim DFa/Fl  =  rank[V/;-  :  j),  (2.3) 

where  the  entries  of  [V/y  :  j]  are  polynomials  in  the  variables  x1;X2, . . . ,  xm  and  the  rank 
is  evaluated  in  the  field  F3. 

An  alternative  formula  for  dimPfj/f,  is  obtained  as  follows.  We  can  assign  real  values 
to  ii,  i2, . . . ,  xm  and  calculate  the  rank  in  3?.  Consider  the  matrix  [V fj(p)  :  j]  which  is  the 
matrix  [V/y  :  j]  evaluated  at  the  point  p  €  5Rm.  Suppose  the  maximum  (over  all  p)  rank  of 
[V/;(p)  :  j]  is  r.  Then  there  must  exist  some  p  G  9?m  and  some  submatrix  of  [V/;(p)  :  j] 
of  dimensions  r  x  r  whose  determinant  is  nonzero.  Consider  the  determinant  (in  F3)  of  the 
corresponding  submatrix  of  [V /;  :  j\.  This  determinant  is  a  polynomial  which,  according 
to  the  above  discussion,  does  not  vanish  at  p.  Therefore,  this  determinant  is  a  nonzero 
polynomial.  Consequently,  the  rank  of  [V /)•  :  j)  (viewed  as  a  matrix  of  elements  of  F3)  is 
greater  than  or  equal  to  r.  By  reversing  this  argument,  we  also  see  that  r  is  no  less  than 
the  rank  of  [V  /;  :  j] .  Hence 

max  rank([V/;  (p)  :  j})  =  rank([V/y  :  ;:). 

Combining  this  with  Eq.  (2.3),  we  obtain  the  following  basic  result: 

dim Pf2/fl  =  max  rank([V/;(p)  :  j]).  (2.4) 

Pfc 


(s  "O'" 

(D(x1),JD(x2),...,D(xm))v/J, 


(2.2) 


We  close  this  section  with  a  result  which  relates  the  transcendental  extension  degree 
and  the  dimension  of  the  associated  space  of  derivations  (see  [ZS  65,  page  125- 127)) . 
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Theorem  2.5  Let  F\  be  a  field  and  let  Fi  be  a  finitely  generated  extension  field  of  F\  such 
that  tr.ci.F2/F1  =  d  and  dim Df3/f1  =  t.  Then  t  is  equal  to  the  smallest  number  r  such 
that  there  exist  elements  Ai,A2,...,Ar  with  the  property  that  F2  is  separable  algebraic  over 
Fi(Ai,  •  •  •  >  Ar).  particular,  t  >  d.  Furthermore,  if  F\  has  characteristic  0,  then  the 
equality  t  =  d  holds. 


3  One-Way  Communication  Complexity. 

In  this  section,  we  study  the  one-way  communication  complexity  of  evaluating  a  set  / 1 , . . . ,  f, 
of  polynomials,  when  the  messages  transmitted  are  restricted  to  be  polynomial  functions  of 
the  data.  We  apply  the  tools  of  field  extension  theory  (presented  in  Sectic:.  2)  to  obtain  a 
bound  for  the  communication  complexity  which  is  almost  optimal  (within  one  message).  It 
will  be  seen  that  our  results  strengthen  earlier  results  in  a  number  of  directions.  We  also 
show  that  the  restriction  to  polynomial  protocols  can  increase  the  communication  com¬ 
plexity  of  the  problem  by  at  most  one  message.  We  then  specialize  to  the  case  where  the 
polynomials  f,  to  be  evaluated  are  of  the  form  fj(x,y)  =  fj[x  +  y),  for  some  functions 
and  we  show  that  there  exist  optimal  protocols  with  a  very  simple  structure:  they  consist 
of  messages  which  are  linear  functions  of  the  data. 

3.1  General  Results. 

The  main  available  result  on  one-way  protocols  is  due  to  Abelson  [A  78]  :3 
Theorem  3.1  Let  f  :  5Rm  x  5Rn  >— »  1R  be  an  infinitely  differentiable  function. 

a)  Let  D  be  a  subset  0}  5Rm  x  !Rn.  There  holds  Coo[f  ',D)  <  r  if  and  only  if  there  exist 

infinitely  differentiable  functions  mi,m2, .  ■ .  ,mr  :  5i”  * — ►  !R  and  h  :  9?r+n  >-*  9t  such 

that 

f{x,y)  =  My,mi(z),m2(x),...,mr(x)),  v(x>y)e£>.  (3.1) 

b)  Let  (z*,y*)  be  some  element  of  l Rm  x  3Rn.  There  exists  some  open  set  D  C  Dtm  x  5R" 

containing  (z*,y‘)  for  which  Coo{f  \  D)  <  r  if  and  only  if 

dim  (span{yiiI>,y2ri',---,Pm>i*})  <  r,  (3.2) 

where  g<t  x.(y)  =  and  where  the  span  is  taken  in  the  vector  space  of  functions 

of  y  defined  on  an  open  set  containing  (z*,y*). 

3  We  »tate  thie  reeult  for  the  class  n<x>(/;D)  of  protocols  that  use  infinitely  differentiable  functions.  The 
result  was  actually  proved  in  [A  78]  for  the  class  fli (f;D)  but  the  proof  remains  valid  for  Tlao{f  ',D). 
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Let  us  consider  protocols  whose  domain  D  is  all  of  !Rm  x  SR".  By  varying  (z*,y*)  over 
all  possible  elements  of  3?m  x  SRn  and  applying  part  (b)  of  the  theorem  to  each  one  of  these 
points  we  obtain 


Coo (/;9im  x  S?n)  >  max  dim  (span{yj 
z*eRm 


•  •  i 


(3.3) 


Part  (b)  of  the  theorem  states  that  this  lower  bound  is  also  tight  in  a  local  sense:  there  exist 
protocols  whose  number  of  messages  equals  the  lower  bound  and  which  evaluate  /  correctly 
when  (z,y)  is  restricted  to  a  suitably  small  domain  D.  However,  nothing  can  be  inferred  on 
the  tightness  of  this  bound  when  one  considers  protocols  whose  domain  is  all  of  5Rm  x  3?n. 
Furthermore,  the  message  functions  m,  in  Eq.  (3.1)  are  not  guaranteed  to  be  polynomials, 
even  if  the  function  /  is  a  polynomial.  Both  of  these  deficiencies  will  be  remedied  in  the 
sequel. 

Throughout  this  section,  we  assume  that  we  are  dealing  with  a  given  set  /  =  {A,  •  ■  • ,  /.«} 
of  polynomial  functions  mapping  3?m  x  9?n  into  5R  and  that  only  one-way  protocols  are 
considered.  We  start  by  proving  a  lower  bound  similar  to  Theorem  3.1(b),  but  more  general, 
because  Theorem  3.1  dealt  only  with  the  case  8  =  1. 

Notation:  For  i  =  1, . . . ,  s,  and  for  any  set  a  =  (aj, . . . ,  an)  of  nonnegative  integer  indices, 
we  define  a  function  y?  :  SRm+n  SR  by  letting 


s“(*> v) = 


dafi 


dy*'dy?---dyZ 


-(*.!/)• 


(3.4) 


(We  use  the  convention  g°  =  ft.)  Let  A  be  the  set  of  all  a  such  that  y“  is  not  identically 
zero  for  some  t.  (Clearly,  A  is  a  finite  set,  since  each  /,  is  a  polynomial.)  For  any  function 
g(x,y)  :  x  5Rn  ■— ♦  5R,  we  use  Vzg  to  denote  the  vector-valued  function  of  dimension  m 

whose  components  are  the  partial  derivatives  of  g  with  respect  to  the  first  m  coordinates. 


Theorem  3.2  Let  D  be  some  open  subset  of  SR™  x  5Rn. 

a)  If  Coo{f  \  D)  <  r,  then  there  exist  infinitely  differentiable  functions  mi, . . . ,  mr  :  5Rm  >-+ 
!R  and  hf  :  !Rr+n  *-*  5R,  i  =  1, . . . a  €  A,  such  that 

9?{z,y)  =  Kit/," Ji(z),...,mr(z)),  V(z,y)€Z?,  (3.5) 


b)  There  holds 

Coo(f ;  D)  >  max  rank [V2y,a(z,y)  : «  =  1, 2, . . . ,  s;  a  €  A]. 
(*.v)e£> 


(3.6) 
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Proof:  a)  Since  Coo(7;  D)  <  r,  there  exist  infinitely  differentiable  functions  mi, . . . ,  mr  and 
<7i, . .  .g,  such  that 

fi(x,y)  —  hi(mi[z),...,mr(x),y),  V(z,y )  e  D, 

We  differentiate  both  sides  of  this  equation,  with  respect  to  y.  The  left-hand  side  yields 
y®(z,  y).  The  right-hand  side  remains  an  infinitely  differentiable  function  of  m;(z),  j  = 
1, . . .  ,r  and  y,  and  h®  can  be  taken  equal  to  that  function. 

b)  Suppose  that  Coo(/;  D)  =  r.  Then  Eq.  (3.5)  holds  for  some  suitable  functions  hf  and  for 
all  (z,y)  €  D.  By  differentiating  both  sides  with  respect  to  z,  we  obtain 

Vz0,“(:r,y)  =  Vzm*(z),  V(z ,y)  e  D,  Vi i.  (3.7) 

Thus,  each  column  of  the  matrix  [Vzy®(z,y)  :  t  =  1,2, . . .  ,s;  o  €  /l]  is  a  linear  combination 
of  the  vectors  VIm1(z),. . . ,  Vzmr(x).  It  follows  that  the  rank  of  that  matrix  is  at  most  r 
for  every  (x,y)  €  D.  Q.E.D. 

We  now  notice  that  any  polynomial  fi  can  be  written  in  the  form 

/<(*.  »)=  H  /ia(x)y?‘y2“2---yn".  (3-8) 

(“i . On)G^ 

where  each  fi*  is  a  suitable  polynomial.  By  differentiating  both  sides  of  (3.8),  setting  y  =  0, 
and  comparing  with  Eq.  (3.4),  we  see  that  for  each  t, a,  there  exists  a  positive  constant  Cja 
such  that 

/<«(*)  =  ciag?{x, 0),  Vz  e  Vtm.  (3.9) 

Let  us  define 

t  =  max  rank[V/ia(z)  : »  =  1, . .  .a; a  €  A]  (3.10) 

Using  Eq.  (3.9)  we  see  that 

t  =  max  rank  [Vxy®(z,0)  :  i  =  1,2, ...  ,s;  a  €  A] 

<  ^  m^ax  ^  rank[Vzy®(z, y)  :  i  =  1,2,..  .,s;a  €  A].  (3.11) 

Corollary  3.1  Cpotv{J;  5Rm  x  lRn)  >  Coo(/;!Rm  x  3?")  >  t. 

Proof:  The  first  inequality  is  trivial  since  we  are  considering  a  restricted  class  of  protocols. 
The  second  follows  from  (3.6)  and  (3.11).  Q.E.D. 

We  make  a  short  digression  to  verify  that  the  bound  t  of  Corollary  3.1  is  a  generalization 
Theorem  3.1. 
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Theorem  3.3  For  the  ease  s  =  1,  that  is,  for  the  problem  of  computing  a  single  polynomial 
/(*» »)  =  5Za€A  f<*{x)yi'  "  ’ Vnn >  value  oft  is  equal  to  the  right-hand  side  of  Eq.  (S.S). 

Proof:  Let  us  fix  some  x*  e  9?m.  Let  r(x*)  be  the  dimension  of  the  span  of  {  J^:(x*,  y),j  = 
1, . . .  ,  m},  where  the  span  is  formed  in  the  vector  space  of  functions  of  the  variable  y.  We 
only  need  to  show  that  maxj.gsn  r(x*)  =  t.  Notice  that 

V,/(x*)  y)  =  £  V./a(x*) »?‘yj*  •  •  •  y“». 

a£A 

Using  the  definition  of  r(x'),  we  see  that  there  exist  m  -  r(x‘)  linearly  independent  vectors 
Pi,  pi, . . . ,  Mm-r(z')  *n  that  are  orthogonal  to  Vr/(x*,y)  for  all  y.  This  is  clearly 
equivalent  to 

pj^z fa{x‘)  =  0,  Vq,  i, 

and  implies  that  rank[Vx/a  :  a  E  A]  <  r(x*).  Taking  the  maximum  over  all  x‘,  we  have 
t  <  r(x*).  The  proof  of  the  reverse  inequality  is  just  the  reverse  of  the  preceding  argument. 
Q.E.D. 

We  now  come  to  the  main  result  of  this  section  which  shows  that  the  lower  bound  of 
Corollary  3.1  is  quite  tight. 

Theorem  3.4  There  exists  an  open  set  Dq  C  5Rm  whose  complement  has  Lebesgue  measure 
zero  and  such  that  Cpoiy[f  \  Do  x  5R")  <  t  +  1. 

Proof:  We  will  show  the  existence  of  an  open  set  Do  and  of  a  set  of  polynomial  message 
functions  mi,  m2, . .  .,mt+i,  such  that  each  /,Q  can  be  expressed  in  the  form 

/•cr(x)  =  /iia(mi(x),...,mt+i(x)),  Vx  G  Do,  (3.12) 

where  hta  is  a  suitable  rational  function.  In  light  of  Eq.  (3.8),  processor  P2  is  able,  upon 
receipt  of  the  messages  mi(x),m2(x), . . .  ,mt+i(x),  to  evaluate  /i(x,y)  for  each  and  this 
will  prove  that  Cpoiv(/;  Do  x  5Rn)  <  t  +  1,  as  desired. 

Let  Fi  =  SR  (the  field  of  real  numbers).  Let  F$  =  Fi({/,a})  be  the  field  generated  by 

the  polynomials  {/,„  :»  =  1 . »;a£^)  over  F%.  Since  Fi  has  characteristic  0  and  F3/F1 

is  finitely  generated,  Theorem  2.5  applies  and  shows  that 

tr.d.Fs/Fi  =  dimDFt/Fl.  (3.13) 

Notice  that  we  are  dealing  with  the  situation  considered  in  the  example  of  Section  2.  In 
particular,  Eq.  (2.4)  shows  that 

dimPf3/Fl  =  max  rank [V/*Q(x)  : »  =  1, . . . ,  s;  a  G  A].  (3.14) 
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By  comparing  with  Eq.  (3.10),  we  see  that  t  =  dimPfi/Fl  and  using  Eq.  (3.13),  we  obtain 

t  =  tr.d.F3/Fj. 


Let  us  choose  a  set  of  indices  such  that 

t  =  max  rank[V/,iai(i),...,V/„a,(i)], 

and  let  F2  stand  for  the  field  generated  by  ••>/«, a,  over  F\.  By  repeating  the  argu¬ 

ment  in  the  preceding  paragraph,  we  obtain  t  =  dim =  tr.d.F2/Fi.  We  then  invoke 
Theorem  2.3  to  obtain 


t  =  tr.d.F3/Fi  =  tr.d.F2/Fi  +  tr.d  .F3/F2  =  t  +  tr.d.F3/F2) 
which  shows  that  tr.d.F3/F2  =  0. 

We  notice  that  F3  is  a  finitely  generated  extension  of  F2,  and  F2  clearly  has  characteristic 
zero.  Therefore,  we  are  in  a  position  to  apply  Theorem  2.5  to  F3/F;  to  conclude  that  F3/F2 
is  a  separable  algebraic  field  extension.  By  Theorem  2.1,  F3/F2  is  also  a  finite  algebraic 
extension.  We  can  therefore  apply  the  theorem  of  primitive  element  (Theorem  2.2)  to 
F3/F2.  This  leads  to  the  conclusion  that  F3  =  F2(/’)  where  /*  is  some  linear  combination 
(over  the  field  F2)  of  the  polynomials  {/fQ  :  (»,a)  ^  (»*,a*),  Vfc.}.  More  precisely, 

/*  =  ££<, a/, a,  (3-15) 

oeA i=i 

where  each  £ja  is  an  element  of  F2  and  where  (ikak  =  0  for  k  =  1, . . . ,  t.  In  particular,  using 
the  definition  of  F2,  each  e,a  can  be  expressed  as  a  rational  function  of  , . . . , 

Since  F3  =  F2(/*)  =  Fi  (/,,  a /,,a  ,,/*),  it  follows  that  each  /,a  can  be  expressed  as 
a  rational  function  of  the  functions  fi,a, ,  /*.  Thus,  there  exist  rational  functions 

hie  such  that 

fia  =  ^ta(/iii»|i  •  •  •  /  )  (3.16) 

Note  that  (3.16)  is  similar  to  (3.12)  except  that  it  refers  to  the  equality  of  two  elements  in 
Fs  and  that  /*  need  not  be  a  polynomial.  Let  5  be  the  set  in  8?m  on  which  the  denominator 
of  some  of  the  rational  functions  under  consideration  vanishes.  The  set  S  has  measure  zero. 
Let  us  denote  the  complement  of  S  by  Do-  Clearly,  Do  is  an  open  set.  By  evaluating  both 
sides  of  Eq.  (3.16)  at  an  arbitrary  vector  x  e  Do,  Eq.  (3.12)  is  obtained,  provided  that  we 
can  replace  /*  by  a  polynomial. 

To  see  that  /*  can  be  replaced  by  a  polynomial,  we  recall  the  representation  (3.15)  of  /*. 
Since  each  is  a  rational  function  of  /«,<»»>  •  •  • ,  fi,a,,  the  function  /*  can  be  expressed  as 
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the  ratio  of  two  polynomials,  /*  =  p/q,  where  q  is  a  common  multiple  of  the  denominators 
of  each  one  of  the  rational  functions  €;a.  It  follows  that  q  is  a  polynomial  function  of 
/«,<»!>  •••»/«, a,-  Let  us  consider  the  one-way  protocol  defined  by  m*  =  fikak,  k  =  1 
and  fht+1  =  /*.  Then,  q  is  known  to  a  processor  who  has  already  received  the  values  of 
•  •  •  >  fitar  Consequently,  transmitting  the  value  p(x)  (as  the  last  message)  carries  the 
same  information  as  transmitting  the  value  /*(x).  We  have  therefore  constructed  a  one-way 
protocol  (with  m*  =  /,kak,  h  —  1,...,*,  and  mt+i  =  p)  which  uses  t  +  1  messages,  and 
all  messages  are  polynomial  functions  of  the  input  x.  Furthermore,  by  Eq.  (3.16)  and  the 
fact  that  q  is  a  polynomial  function  of  mi, . . .  ,  m*,  we  see  that  Eq.  (3.12)  holds  for  some 
suitable  rational  functions  hia .  Q.E.D. 

In  order  to  turn  Theorem  3.4  into  a  useful  result,  one  needs  a  computationally  effective 
method  for  evaluating  t'  and  for  constructing  a  protocol  that  uses  t  +  1  messages.  The 
solution  to  this  problem  is  not  apparent  and  depends  on  the  structure  of  the  field  F3. 
However,  our  proof  does  suggest  a  randomized  procedure,  which  we  now  outline.  Assuming 
that  the  number  of  functions  f,a  is  not  excessive,  we  can  evaluate  the  rank  of  the  matrix 
consisting  of  the  gradients  Vx/ia  at  a  random  point.  Obviously,  except  for  a  closed  set  of 
zero  measure  (an  algebraic  set)  we  will  find  the  maximum  rank  t,  as  well  as  polynomials 
/•jau  •••>/», a,  with  the  desired  properties.  Moreover,  according  to  *  le  remark  following 
Theorem  2.2,  we  know  that  the  overwhelming  majority  of  choices  of  the  coefficients  c,Q  in 
Eq.  (3.15)  are  acceptable. 

To  summarize  the  results  in  this  subsection,  we  have  shown  that  (as  long  as  we  are 
willing  to  disregard  a  set  of  points  of  measure  zero)  the  restriction  to  polynomial  messages 
can  increase  the  communication  complexity  by  at  most  one.  This  is  in  contrast  to  the  earlier 
results  (Theorem  3.1)  that  asserted  the  existence  of  protocols  which  are  not  necessarily 
polynomials  and  whose  domain  is  only  some  (possibly  very  small)  open  set. 

3.2  Computing  Polynomials  of  the  Form  /( x  +  y )• 

In  this  subsection  we  consider  the  special  case  where  all  of  the  polynomials  /,  :  9?' n  x  3? n  >-+  3? 
to  be  computed  are  of  the  form 

/»(*.  y)  =  /<(*  +  »).  •  =  1,2, 

where  each  /,  :  3in  •-»  5R  is  a  polynomial. 

We  exploit  this  special  structure  and  show  that  linear  protocols  (i.e.,  the  messages  are 
linear  functions  of  the  input)  are  optimal  within  the  class  of  protocols  that  use  infinitely 
differentiable  message  functions. 
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Let,  as  in  the  preceding  subsection, 

We  view  7»  as  a  function  of  a  variable  z  £  3?"  and  we  define 


Let 


t  =  max  rank[V25.Q(z)  :  i  =  1, . . . ,  s;  a  €  A]. 

x63?" 


(3.17) 


Theorem  3.5  Rn  x  3Rn)  =  C!inear(7; SRn  x  5R")  =  t. 


Proof:  We  first  prove  a  lower  bound.  Using  Theorem  3.2(b),  we  have 
Coo(7;5Rn  x  !Rn)  >  max  rank[Vxy?(x,y);*,a]. 

(i,v)e»nxRn 

We  notice  that  y“(z)  =  y“(x,  y)  and  Vxyt°(z)  =  Vxy“(x,y),  where  z  =  x  +  y.  We  thus 
obtain 

Coo(7;^n  x  SR")  >  max  rank[Vxj®(*  +  y);  t,  a] 

(ii()£RnxS'1 

=  max  rank[V*y“(z);t,a] 

=  t, 

which  proves  the  lower  bound.  Given  that  Coo  (7;^"  X  3?n)  <  C/ir,eor (7i  3?”  x  SR"),  the  proof 
of  the  theorem  will  be  completed  once  we  establish  that  Cunta r(7>  3? n  x  SR")  <  t. 

We  first  consider  the  case  where  t  —  n.  In  this  case,  we  can  use  the  protocol  defined  by 
m*(x)  =  Xk,  k  =  1, . . . ,  n.  (That  is,  processor  Pi  transmits  its  entire  vector  to  processor 
Pj.)  This  is  clearly  a  linear  protocol  with  t  messages  and  establishes  the  desired  result  for 
the  case  t  =  n.  Notice  also  that  the  case  t  >  n  cannot  occur  since  t  is  the  rank  of  a  matrix 
with  n  rows. 

The  proof  of  the  upper  bound  for  the  general  case  (t  <  n)  proceeds  by  induction  on  n. 
For  the  basis  of  the  induction,  we  consider  the  case  where  n  =  1.  If  t  =  n  =  1,  then  the 
result  is  true,  by  the  argument  of  the  preceding  paragraph.  If  on  the  other  hand  t  =  0,  then 
Vxfi“(z)  =  0  for  all  z  €  SR  and  all  t,  a.  By  letting  a  =  (0,0, . . .  ,0),  we  see  ttiat  Vx/j(z)  =  0 
for  all  z  and  i.  Therefore,  each  7<  is  a  constant  function.  In  this  case,  processor  Pi  can 
compute  fi(x,  y)  for  each  »,  without  receiving  any  messages,  and  Cunear(f  ,  9?"  x  5Rn)  =  0  =  t, 
as  desired. 


18 


We  now  assume  that  the  result  has  been  proved  for  n  -  1  (n  >  2)  and  we  prove  it  for  n 
as  well.  The  case  t  =  n  has  already  been  dealt  with  and  we  assume  that  t  <  n. 


Lemma  3.1  If  t  <  n,  then  there  exists  a  nonzero  vector  c  =  (ci,  c2, . . . ,  cn )  G  9?"  such  that 


X>4f(*)  =  0,  Vi,*.  (3.18) 

i= l  az> 


Proof:  The  left  hand  side  of  Eq.  (3.18)  is  a  polynomial,  therefore,  it  suffices  to  show  that 
the  coefficient  corr -^ponding  to  each  term  zf1 z%*  •  ■ • z%n  is  identically  zero.  Let  us  denote 
the  coefficient  corresponding  to  the  term  z“‘z“ 2  *  *  •  z°"  of  dfijdz y  by  da(tj).  Then  Eq. 
(3.18)  becomes  equivalent  to  l  cjda[ij)  =  0  for  all  «  and  q. 

Let  H(z)  =  [Vzh“ (z);t  =  1, a  €  A],  and  consider  the  matrix  H{ 0).  Note  that  the 
column  of  H( 0)  corresponding  to  indices  «,  a,  is  equal  to 

a!  (da(iT),da(t2),...,dQ(«n)), 

where  a!  =f  aila^!  •  •  •  an!.  (This  is  because  the  terms  corresponding  to  a'  /  a  are  either 
washed  out  by  the  differentiations  or  are  set  to  zero  when  we  let  z  =  (0,0, . . .  ,0).)  We 
have  ranktf(0)  <  max,6s-.  rank# (z)  =  t  <  n.  Therefore,  there  exists  a  nonzero  vector 
c  =  (ci, . . . ,  c„)  G  5Rn  which  is  orthogonal  to  each  one  of  the  columns  of  H( 0).  This  implies 
that  Cjda(ij)  =  0  and  concludes  the  proof  of  the  lemma.  Q.E.D. 

Without  loss  of  generality,  we  assume  that  c„  ^  0,  where  c„  is  the  last  coordinate  of 
the  nonzero  vector  c  given  by  Lemma  3.1.  We  define  an  invertible  linear  transformation 
T  :  5Rn  JRn  by  means  of  the  formula 


Tz  =  (zi  +  ClZ„,Z2  +  c2zn,...,zn_i  +  C„-lZn,CnZ„). 

We  will  show  that  this  coordinate  transformation  leads  to  polynomials  that  are  independent 
of  the  last  coordinate  of  their  argument,  which  will  then  allow  us  to  use  the  induction 
hypothesis. 

Consider  the  polynomials  and  defined  by 

fi{z)  fii'I'z)  —  fi(^Zi  CiZn, .  .  .  ,  Zn-i  Cn-iZn,  CnZn^ ,  (3.19) 

/i(*,v)  =  +  (3.20) 


Using  the  chain  rule  and  Eq.  (3.18),  we  see  that 


M  =  v  dA 

dzn  fr'^dzj 


0. 
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Therefore,  the  polynomials  f-  are  independent  of  the  last  coordinate  of  their  argument  and 
can  be  viewed  as  mappings  defined  on  !Rn_1  (instead  of  SR"). 

Given  that  T  is  an  invertible  linear  transformation,  it  is  easily  seen  that  the  rank  of  the 
matrix  considered  in  (3.17)  does  not  change  if  each  /,  is  replaced  by  /?.  We  now  apply  the 
induction  hypothesis  on  the  functions  /'  =  {/[,■■■,/',}  to  conclude  that 

CMnear(7';*nX  *")<*. 

Let  the  linear  functions  mi(i), . . . ,  m](z)  correspond  to  a  linear  protocol  for  the  problem  of 
evaluating  the  functions  in  /'.  It  follows  that  there  exist  polynomials  g[, . . .  ,g't  such  that 

/,'(*, y)  =  ®<(»»i (*),••••  •"*(*),»), 

Therefore, 

/<(*,»)  =  h(x  +  y)  =  fl(T-\x  +  y))  =  fl(T-1x,T-ly) 

=  9,i{m\{T~'lx),...,m\{T~'lx),T~'ly)>  Vi,x,y, 

where  we  have  use  of  the  definitions  (3.19)  and  (3.20).  Thus,  the  functions  m\, . . .  ,mt 
defined  by  m,(x )  =  m[(T~1x),  i  =  l,...t,  define  an  one-way  protocol  for  the  problem  of 
evaluating  /i,  f2,  ■  ■  • ,  /«■  Furthermore,  each  m,  is  linear,  since  it  is  the  composition  of  linear 
functions.  Therefore,  Cnnear(f  ;  5R"  x  Sftn)  <  t.  This  completes  the  induction  and  the  proof 
of  the  theorem.  Q.E.D. 

We  remark  that  the  proof  of  Theorem  3.5  actually  provides  a  procedure  for  constructing 
a  linear  and  optimal  protocol.  Furthermore,  the  proof  shows  that  we  do  not  need  to  evaluate 
max*£»"  rankff(z)  but  only  the  rank  of  H( 0).  If  the  latter  rank  is  equal  to  n,  the  problem 
is  trivial,  and  if  it  is  less  than  n,  Lemma  3.1  applies  and  the  problem  can  be  reduced  to 
one  with  a  smaller  dimension.  Another  point  worth  mentioning  is  that  our  proof  actually 
suggests  a  deterministic  procedure  for  constructing  the  optimal  linear  protocol.  In  fact, 
one  can  first  compute  the  rank  of  H( 0).  If  rankf/(0)  =  n,  then  m*(x)  =  x*  is  an  optimal 
protocol.  If  /f(0)  has  rank  less  than  n,  then  one  can  use,  for  example,  Gaussian  elimination 
method  to  find  a  nonzero  vector  c  such  that  cT H{ 0)  =  0.  As  shown  in  the  proof,  the 
problem  is  reduced  to  one  with  a  smaller  dimension  by  a  suitable  change  of  variables.  By 
repeating  this  process  at  most  finitely  many  times,  one  will  find  an  optimal  linear  protocol 
for  computing  functions  /,  ,  i  =  1, . . . ,  *. 

4  Preliminaries  Continued. 

In  this  section,  we  review  some  results  (e.g.  Hilbert’s  Nullstcllensatz)  from  algebraic  geom¬ 
etry  (see  e.g.  (AM  69, H  77] )  that  will  be  needed  in  Section  5. 
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Let  C  jzj ,  xj, . . . ,  xn]  denote  the  ring  of  polynomials  of  variables  xi,...,xn  over  C,  the 
field  of  complex  numbers.  Let  f,g  E  C[xi,x2, . . . ,  We  use  the  notation  f\g  and  say 
that  /  divides  g  if  there  exists  some  h  E  C[xi,i2 , . . .  ,xn]  such  that  g  =  /  •  h.  W'e  say  that 
a  polynomial  g  E  C\x i,  x2»  ■  •  • »  in]  is  irreducible  if  =  /  •  h  implies  that  either  /  or  h  is  an 
element  of  C.  As  is  well  known,  C\x\,x 2, . . .  ,  xn]  is  a  unique  factorization  ring,  that  is,  each 
one  of  its  elements  can  be  expressed  as  a  product  of  irreducible  polynomials.  Furthermore, 
this  factorization  is  unique  up  to  reordering  of  the  factors  and  up  to  multiplication  of  each 
factor  by  an  element  of  C. 

Let  be  some  polynomials  in  C[xi,x2, . . .  ,i„].  We  define  the  zero  set  of 

/1  >  •  •  • » /r  by 

V(/l,.  ..Jr)  =  {(xi,X2  ...,X„)  €  Cn\  fk[xi,X2...,Xn)  =  0,  1  <  k  <  r}. 

We  now  state  a  simple  version  of  Hilbert’s  Nullstellensatz  [AM  69,  page  85]  that  will  be 
used  in  Section  6. 

Theorem  4.1  (Hilbert’s  Nullstellensatz)  Let  /i,..,/r  be  some  polynomials  in  C[xi, ...,  xn 
If  g  E  C\x  i, . . .  ,xn]  andV(fi, . . . ,  fr)  C  V(g),  then  there  exist  some  polynomials  g\, . . .  ,gr  € 

C[x i, . . .  ,xn]  and  some  positive  integer  k  such  that 

9*  =  ffl/l  +  <?2/2  + - l-Pr/r-  (4-1) 

Notice  that  if  Eq.  (4.1)  holds,  then  gk  E  S  and  V(/j, . . . , /r)  c  V [gk)  =  V(c?).  The  fact 
that  the  converse  is  also  true  is  exactly  the  content  of  Hilbert’s  theorem. 

Corollary  4.1  If  f,g  E  C[*i, . . .  ,x„],  and  if  f(x)  =  0  implies  that  p(x)  =  0,  x.e.,  if 
V(f)  c  V(g ),  then  there  is  an  integer  k  and  some  h  E  C[x  1, . . . ,  x„]  such  that  gk  =  Jh.  (In 
other  words,  f\gk-) 

One  can  assign  a  topology  to  the  field  Cn  by  taking  the  family  (V'(S)  |  S  is  an  ideal}  as 
the  closed  sets.  (It  is  a  simple  exercise  to  check  that  these  sets  satisfy  the  usual  requirements 
for  the  closed  sets  of  a  topology.)  Traditionally,  this  topology  is  called  the  Zariski  topology 
on  C.  An  important  property  of  Zariski  topology  is  the  following  (see  [H  77]). 

Theorem  4.2  Every  two  nonempty  Zariski  open  sets  of  Cn  have  nonempty  intersection  and 
every  closed  set  has  zero  Lebesgue  measure. 
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5  Two-Way  Communication  Complexity. 


In  this  section  we  study  the  two-way  communication  complexity  of  evaluating  a  function 
/  :  Df  *-*  C,  where  Df,  the  domain  of  /  is  an  open  subset  of  Cm  x  Cn.  Throughout,  we 
assume  that  /  is  twice  continuously  differentiable  on  Df. 

5.1  Abelson’s  Lower  Bound. 

Definition  5.1  We  let  Hzy(f)  be  the  matrix  (of  size  m  x  n)  whose  ( i,j)-th  entry  is  given 
*2  / 

by  jz  gy  ■  We  use  the  alternative  notations  (Hzy(f))(p)  and  Hzy(f)\p  to  denote  the  value 
of  Hzy(f)  at  some  vector  p  €  Df.  Also,  we  let  Vzf  and  V  yf  stand  for  the  vectors  of  dimen¬ 
sions  m  and  n  (respectively)  with  the  partial  derivatives  of  f  with  respect  to  the  components 
of  x  and  y,  respectively. 

The  following  basic  result  has  been  established  by  Abelson  [A  80]  :4 

Theorem  5.1  For  any  open  set  D  C  Dj  and  any  p  6  D,  we  have 

C2(f-D)>  xanY(Hzv(f))(p).  (5.1) 

Theorem  5.1  has  an  obvious  corollary 

Corollary  5.1  For  any  open  set  D  C  Df,  we  have 

C2(f  -,D)  >  max  rank (Hzy(f))(p).  (5.2) 

The  matrix  Hzy(f)  is  defined  in  terms  of  the  cross  derivatives  of  /  and  in  some  sense 
provides  information  on  how  x  and  y  are  interrelated  in  the  formula  for  /( z,y).  On  the 
other  hand,  Eq.  (5.1)  only  takes  into  account  the  second  order  derivatives  of  /  and  ignores 
the  higher  order  derivatives  or  the  first  order  derivatives  of  /.  Thus,  this  bound  should 
not  be  expected  to  be  tight,  in  general.  As  an  example,  let  /  be  a  linear  function,  e.g, 
/(*,  y)  =  aT x  +  bTy  (a  €  Cm,b  6  Cn,a  ^  0, 6  ^  0).  It  is  clear  that  Cz(f  ,  D)  =  1,  for  any 
open  set  D,  while  Eq.  (5.1)  gives  a  vacuous  lower  bound  of  zero.  The  following  corollary 
strengthens  Eq.  (5.1)  somewhat,  by  incorporating  the  first  order  derivatives  of  /  as  well  It 
is  only  a  minor  improvement  because  it  can  increase  the  lower  bound  by  at  most  1. 

4This  reiult  wm  actually  proved  in  [A  80]  for  real-valued  functions  defined  on  but  the  proof 

remains  valid  when  Si  is  replaced  by  C. 
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Corollary  5.2  For  any  open  set  D  C  Dj  vie  have 

Ci{f  \D)  >  maxmaxrank[(^Iy(/))(p)  +  cVI/(p)- Vv/(p)r). 
cec  peD 

Proof:  We  notice  that  C2(/;  D)  >  C2{g  o  /;  D),  for  any  twice  continuously  differentiable 
function  g  :  C  •-*  C,  where  g  o  /  denotes  the  composition  of  /  and  g.  For  any  p  e  D,  and 
c  €  C,  consider  a  function  g  such  that  g'{J{p ))  /  0  and  c  =  g"{f{p))/g'{f[p))-  The  result 
then  follows  by  applying  Theorem  5.1  to  the  function  g  o  / .  Q.E.D. 

In  the  remainder  of  this  section,  as  well  as  in  the  next  section,  we  investigate  the  extent 
to  which  Abelson’s  bound  is  tight  and  we  derive  some  tighter  bounds.  We  will  mostly 
restrict  attention  to  the  case  where  /  is  a  rational  function  and  we  will  require  the  messages 
to  be  rational  functions  of  the  input.  In  the  next  subsection,  we  identify  two  instances 
where  Abelson’s  lower  bound  (Theorem  5.1)  is  tight.  Then,  in  Subsection  5.3,  we  establish 
some  new  general  lower  bounds  by  making  use  of  Hilbert’s  Nullstellensatz. 

5.2  Some  Cases  Where  Abelson’s  Bound  is  Tight. 

We  consider  here  two  particular  cases  in  which  Abelson’s  bound  (Theorem  5.1)  can  be 
shown  to  be  tight.  This  is  in  contrast  to  the  results  in  Section  6  in  which  it  will  be  shown 
to  be  far  from  tight. 

Theorem  5.2  Suppose  that  /(z,  y)  =  xTQ y,  where  Q  is  a  matrix  of  size  m  x  n  and  x  € 
5?m,  y  G  5Rn.  Then  C2{f\  !Rr,+m)  =  rank Hxv{f)  =  rank(Q).  In  fact,  the  lower  bound  can  be 
attained  by  a  one-way  protocol  with  linear  messages. 

Proof:  Let  rank(Q)  =  r.  By  Theorem  5.1,  we  see  that  C2(/;5Jn+m)  >  rank Hzy[f)  = 
rank(Q)  =  r.  To  prove  the  other  direction  of  the  inequality,  we  will  present  a  one-way 
linear  protocol  that  uses  exactly  r  messages.  Using  the  singular  value  decomposition  of  Q, 
there  exist  vectors  uj, . . . ,  ur  €  !Rm  and  t>i, . . . ,  vr  E  5?n  such  that 

T'  7"  7* 

Q  =  t<iv{  +  u2u2  -i - 1-  urvr  , 

from  which  we  obtain 


xTQy  =  xTu1vf  y  +  xTu2v[ y+ - 1-  xTurvJ y. 


(5.3) 


Notice  that  in  Eq.  (5.3)  each  one  of  the  expressions  xTu,  and  vj y  is  a  scalar.  Thus,  the 
one-way  protocol  with  r  linear  messages,  defined  by  m,(x)  =  zru, ,  i  —  1, . . . ,  r,  is  adequate 
for  computing  /.  Q.E.D. 
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Theorem  5.2  states  that  Abelson’s  bound  is  tight  for  homogeneous  quadratic  polynomi¬ 
als.  What  happens  for  polynomials  of  degree  greater  than  2?  In  what  follow's,  we  will  show 
the  tightness  of  Abelson’s  bound  for  computing  functions  of  the  form:  /(x,y)  =  g{x  +  y), 
where  g  is  a  nonlinear  homogeneous  polynomial  in  no  more  than  4  variables.  W’hile  this 
result  determines  a  case  for  which  C2(/;  !R2n)  can  be  determined  completely,  it  is  of  little  use 
in  practice.  This  is  because  we  have  n  <  4  and  the  naive  protocol  rrii(x)  =  x,,  i  =  1, . . . ,  n, 
uses  at  most  4  messages  and  cannot  be  too  far  from  being  optimal.  Our  result  makes  use 
of  the  following  theorem  proved  by  Gordan  and  Noether  in  1876  ([GN  76]). 

Theorem  5.3  Let  f  :  SRn  ►-»  9?  be  a  nonlinear  homogeneous  polynomial  in  n  <  4  variables 
and  let  H(f)  be  its  Hessian  matrix.  If  detH(f)  =  0,  then  there  exists  a  linear  mapping  T 
from  SRn  onto  5Rn-1  and  a  homogeneous  polynomial  g  :  9?n-1  h-»  SR  such  tha l  /(x)  =  g{Tx). 

Our  result  is  the  following: 

Theorem  5.4  Let  g  :  SRn  •—»  SR  be  a  nonlinear  homogeneous  polynomial  and  let  the  polyno¬ 
mial  f  :  5R2n  •— ♦  9?"  be  defined  by  f(x,  y)  =  g(x  -f  y).  If  n  <  4,  then 

C2(/;9?2n)=  max  rank//Iy(/)|{l  j  =  CMnf«r(/;  SR2")- 

Proof:  Let  z  =  x  +  y.  We  regard  g  as  a  nonlinear  polynomial  in  the  variable  z  6  S". 
Let  k  be  the  smallest  integer  such  that  there  exists  some  linear  mapping  T  from  9?n  onto 
SR*  and  some  homogeneous  polynomial  g  :  SR*  *-►  SR  such  that  g{z)  =  g{Tz).  Since  g  is 
nonlinear  and  T  is  linear,  we  see  that  g  is  also  nonlinear.  We  claim  that  there  exists 
some  vector  z  =  (zi,...,z*)  €  SR*  at  which  H[g)  is  nonsingular.  Indeed,  if  this  is  not  so, 
then  by  Theorem  5.3,  there  exists  another  linear  mapping  T  from  9?*  onto  SR*-1  and  some 
homogeneous  polynomial  g  :  9J*_1  *-*  9i  such  that  g{z)  =  g(Tz).  But  this  implies  that 
g(z)  =  $(TTz).  Since  the  composition  of  T  snd  T  maps  SRn  onto  SR*-1,  this  contradicts  the 
definition  of  k. 

A  simple  calculation  shows  that  H[g)\t  =  Tt H{g)\rtT .  Since  T  maps  5Rn  onto  9?*,  the 
matrices  T  and  Tt  have  full  rank  and  we  obtain  rankH(y)|,  =  rank//(y)|j,.  Since  the 
range  of  T  is  all  of  9?*,  we  have 

max  rank/f(o)  .  =  max  ranki/(6)!i  =  k. 

*e»"  ie*k  ^ 

Since  #iv(/)|(i,v)  =  H{g)\i=z+y,  we  obtain  that  max^j,  rank//IV(/)  =  Jt.  It  then  follows 
from  Theorem  5.1  that  C2(/;5R2n)  >  k.  To  establish  the  reverse  inequality,  we  will  present 
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a  protocol  for  computing  /  that  uses  exactly  k  messages.  Let  m,(x)  =  7\x,  i  = 
where  T,  is  the  *th  row  of  the  matrix  T.  Then, 

/(*»  V)  =  9{x  +  y)=g{T{x  +  y))  =  g{Ti{x  +  y),...,Tk{x  +  y)) 

=  Hmi(I)  +  7’iy,...,m*(x)  +  r*y). 

This  last  formula  shows  that  /  can  be  computed  using  the  one-way  protocol  with  messages 
m,(x)  =  T,x,  i  =  1, . . . ,  k.  In  particular,  Ci{f  \  SR2")  <  Qlfvear(/;  SR2")  <  k,  which  completes 
the  proof.  Q.E.D. 

Unfortunately,  Theorem  5.4  is  not  true  for  the  case  n  >  4,  for  the  simple  reason  that 
Theorem  5.3  fails  to  hold.  Historically,  Hess  had  published  a  paper  in  which  he  gave  an 
erroneous  proof  of  Theorem  5.3  for  all  n.  It  was  later  discovered  by  Gordan  and  Noether 
that  Hess’  proof  was  incorrect  and  proved  that  the  largest  value  of  n  for  which  Theorem 
5.3  holds  is  4. 

5.3  Some  New  Lower  Bounds. 

Throughout  this  subsection  we  assume  that  {  \  Dj  *-*  C  is  a  complex  rational  function, 
where  Df  C  Cm  X  Cn  is  the  set  of  vectors  ( x,y )  at  which  /  is  finite.  In  this  context  it 
is  natural  to  consider  “rational”  protocols,  in  which  the  messages  transmitted  are  rational 
functions  of  the  input  data  (x,y). 

We  present  two  new  methods  for  establishing  lower  bounds  on  the  two-way  communi¬ 
cation  complexity  in  this  setting.  The  first  method  provides  lower  bounds  on  Crat(f;D) 
for  any  open  subset  D  €  Df.  The  second  method  requires  that  D  =  Dj  but  usually  gives 
sharper  lower  bounds. 

Our  first  method  (Theorem  5. 5-5. 7)  exploits  the  fact  that  any  rational  protocol  can 
be  converted  into  a  protocol  in  which  the  messages  are  polynomial  functions  of  (x,y)  and 
which  uses  at  most  .wice  as  many  messages: 

Theorem  5.5  Let  f  be  a  rational  function  and  let  D  be  an  open  subset  of  Df.  Then  there 
holds 

Crat(/i-^/)  ^  Cp0ly(f’,Df)  <  2Crot(/;  Df). 

The  idea  behind  the  proof  of  Theorem  5.5  is  that  each  rational  message  of  a  rational 
protocol  can  be  replaced  by  two  polynomial  messages  consisting  of  the  numerator  and 
denominator  polynomials  (respectively)  of  the  original  message.  The  proof  can  be  found  in 
(L  89]  and  is  omitted  because  it  is  relatively  straightforward  and  also  because  Theorem  5.5 
will  not  be  invoked  in  subsequent  proofs. 
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Suppose  that  /(x,y)  =  p(x,y)/q(x,y)  where  p  and  q  are  two  relatively  prime  poly¬ 
nomials.  In  particular,  Dj  =  {(x,y)  |  q[x,y)  yt  0}.  Let  D  C  Dj  be  an  open  subset. 
Consider  some  rational  protocol  it  £  Urat(f  ',  D)  with  r  messages,  where  r  =  Cra<(/;  D)  (cf. 
Section  1).  Then,  by  Theorem  5.5,  there  exists  a  polynomial  protocol  it'  £  Tlpoiy[f ;  D) 
that  uses  2r  messages.  Let  be  the  message  functions  of  the  protocol  it'.  As¬ 

suming  that  processor  Pi  performs  the  final  evaluation  of  f(x,y),  we  must  have  /(x,y )  = 
h(x,mi(x,y), . . .  ,m2r(x,y))  for  all  (x,y)  £  D,  where  h  is  a  rational  function.  Since  h  is 
rational,  we  must  have  /(x, y)  =  p'{xty)/q'{x,y),  where  p'  and  q'  are  some  polynomials 
whose  values  (on  the  set  D)  are  completely  determined  by  the  values  of  the  message  func¬ 
tions  mi, . . .  ,m2r  and  x.  This  implies  that  Cpoiy{p'\  D)  <  2r  and  Cpoiy(q'\  D )  <  2r.  Notice 
that  p/q  =  p'/q'-  Using  the  unique  factorization  property  of  rational  functions  over  C  (cf. 
Section  4),  we  see  that  p'  =  pg  and  q'  =  qg  for  some  nonzero  polynomial  g.  We  conclude 
that  there  exists  some  nonzero  polynomial  g  such  that 

Cratif;  D)  ^  2^Poiv(P^>  D) 

and 

Cratif  \  D)  ^  2^'po,v(95i -^)- 

This  shows  that  we  can  bound  from  below  the  communication  complexity  of  /  by  bounding 
from  below  the  communication  complexity  of  pg  or  qg.  The  difficulty,  however,  is  that  the 
polynomial  g  is  not  known  and  we  are  forced  to  develop  a  bound  which  is  valid  for  an 
arbitrary  choice  of  g.  Ideally,  we  would  like  to  able  to  say  that  if  p  has  high  communication 
complexity  then  the  same  is  true  for  pg.  Although  this  does  not  seem  to  be  true  in  general, 
the  following  result  makes  a  step  in  that  direction. 

Theorem  5.6  Let  f,g  £  Cjxi, . . . ,  xm,yi, . . .  ,y„]  be  two  nonzero  polynomials  which  are 
relatively  prime.  Then, 

C,(/j;Cm+" )>(,m.X(/)  „nkff„(/)|M  -2, 
where  V(f)  =  {(x,y)  |  /(x,  y)  =  0}  is  the  zero  set  of  the  polynomial  f. 

Proof:  By  Theorem  5.1,  we  have 


C2(fg;Cm+n)  >  ^  rneaxm+nrank(^(/y))|{iv) 

=  +" rank  {^x’y^H^(9)\(z,v)  +  9{x,y)HIV{f)\{X:V) 


+ 
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+  (V*/(x,y))(Vvy(z,y))T  +  (VlS(x,y))(Vv/(x,y))T) 

^  (..,KC”-r“K/(l's)^'(5)l(>.»)  +  5(X'!')"',(/)K-.v))  “2 

-  (».!?“( /)r“k  • 2- 
Choose  some  (xo.yo)  €  V(/)  such  that 

rank  (  Hxy(f)\  J=  max  i&nV.Hzy[f)\  =  r. 

Then,  there  exists  a  submatrix  M  of  size  r  x  r  embedded  in  Hzy[f),  which  is  nonsingular 
at  (xo,  yo).  We  view  this  submatrix  as  a  function  of  (x,y)  and  we  consider  its  determinant 
det(JVT)  which  is  a  polynomial  in  (x,y).  We  have  just  shown  that  V(f)  is  not  contained  in 
V(det(M)).  In  other  words,  if  we  write  /  as  a  product  of  irreducible  polynomials,  then  at 
least  one  of  the  irreducible  factors  of  /,  call  it  /i,  does  not  divide  det(M).  But  since  /  and  g 
are  relatively  prime,  it  follows  that  f\  does  not  divide  g  either.  We  conclude  that  f\  does  not 
divide  g  -det(M).  We  now  claim  that  V(/)  £  V(g  -det(Af)).  If  the  claim  is  not  true,  then 
V(f)  C  V[g  ■  det(A/)).  Hilbert’s  Nullstellensatz  applies  and  shows  that  (g  ■  det(M))*  =  fh 
for  some  positive  integer  k  and  some  polynomial  h.  By  the  unique  factorization  property, 
we  see  that  the  irreducible  polynomial  f\  would  have  to  be  a  factor  of  either  g  or  det(A/), 
which  is  a  contradiction  and  establishes  our  claim. 

Since  V (/)  <£  V [g  •  det(Af)),  there  exists  some  (x‘,  y*)  G  V (/)  such  that 
y(x", y*)det(M)|(I.  ^0.  Consequently, 

,  max  rank  (y(x,y)/fxv(/)|(  ))  >  rank  (g{x\  y')Hxv[f)\{t.  „.)) 

(i,|/)6  V[f)  '  " 

=  r 

=  max  rank  ( Hzv(f)  )  . 

(*.v)6V(/)  V  v  (*.v)/ 

which  completes  the  proof  of  the  theorem.  Q.E.D. 

The  above  theorem  states  that  if  rank Htv[f)  is  large  for  some  (x,y)  G  ^(/))  then  fg 
also  has  large  communication  complexity  for  any  polynomial  g  which  is  relatively  prime  to 
/.  Unfortunately,  Theorem  5.6  is  not  always  sufficient  for  proving  tight  lower  bounds  for 
fg  because  there  exist  functions  /  for  which  rank Hxv[f)  is  small  for  every  (x,y)  G  V(f) 
even  though  Htv(f)  has  high  rank  when  the  restriction  (x, y)  G  V(f)  is  removed.  A  specific 
example  will  be  seen  in  the  next  section. 

The  following  is  a  result  from  algebraic  geometry  which  gives  a  sufficient  condition  on 
/  under  which  Hzy{f)  has  high  rank  at  some  point  belonging  to  V(f). 
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Theorem  5.7  Let  J  be  a  nonlinear  homogeneous  polynomial  in  n  variables  such  that  V/(x)  ^ 
0  for  every  x  E  V(f).  Let  the  polynomial  f  :  C2n  >— ►  C  be  defined  by  f(x,y )  =  f(x  +  »)■ 
Then, 

max  rank  (  #*„(/)  I  >  n  -  1. 

The  proof  of  the  above  theorem  can  be  found  in  [Z  83]  and  [KL  84] .  As  an  immediate 
consequence  of  above  Theorems  5.6  and  5.7  we  have  the  following: 

Corollary  5.3  Let  f  and  f  be  as  in  Theorem  5.7  Then, 

C2(/-<?;C2n)>n-l 

for  any  polynomial  g  which  is  relatively  prime  to  f . 

Unfortunately,  the  above  corollary  is  not  easy  to  apply,  because  the  set  V  (/)  is  usually 
hard  to  determine.  Accordingly,  the  condition  Vf(x)^0on  the  set  V[  f)  cannot  be  easily 
tested.  In  fact,  it  seems  a  lot  easier  to  just  compute  the  rank  of  Hzv[f)  at  a  random  point 
of  V{f)  because  maxj, ,v)ev(f)  ranktf*v(/)|(XiV}  is  attained  at  the  majority  of  points  onV(/) 
(a  Zariski  open  set  of  V(/)). 

We  have  so  far  shown  that  lower  bounds  on  the  communication  complexity  of  a  rational 
function  /  =  p/q  (p  and  q  are  relatively  prime)  can  be  obtained  by  developing  lower  bounds 
on  the  communication  complexity  of  pg  or  qg,  where  g  is  an  arbitrary  nonzero  polynomial. 
We  now  develop  our  second  method  for  establishing  lower  bounds  by  exploiting  the  fact  that 
if  a  protocol  is  to  have  domain  the  set  Df  on  which  /  is  finite,  then  the  polynomial  g  is  not 
entirely  arbitrary.  We  have  shown  earlier  that  if  /  can  be  evaluated  by  a  rational  protocol 
with  domain  Df,  then  there  exist  polynomials  p'  and  q'  such  that  /(x,y)  =  p'(x,y)/q'(x,  y) 
for  all  (x,y)  £  Df  and  Cpoiv(f  \Df)  >  \Cpoiv(q';  Df).  The  polynomial  q'  must  certainly 
satisfy  q'  =  qg,  for  some  g,  but  it  must  also  be  nonzero  at  every  point  in  the  domain  D/  of 
/  because  otherwise  the  expression  p'{x,y)/q'[x,  y)  will  be  meaningless  for  some  (x,y)  €  Df. 
This  additional  constraint  is  used  in  an  essential  way  in  the  following  result. 

Theorem  5.8  Suppose  that  f  is  a  rational  function  and  that  f  =  p/q,  where  p,q  E 
C[x  i, . . .  ,zm,yi,. . .  ,y„]  are  relatively  prime  polynomials.  If  q  is  irreducible,  then 

a) 

Crat{f  ]  Df)  >  v™£mxCn  rank^*v(?) I(z.y)  -  1-  (5-4) 
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b) 


Crat(f,Df)  >  ^  (zty)2c*xCn  ranktf*y(p)|(Xitf)  - 


(5.5) 


Pr^of:  a)  Consider  a  rational  protocol  for  computing  /  on  Df  that  uses  r  =  Crat{f ;  Df) 
messages  and  let  :  D/  >-*  C  be  the  corresponding  message  functions.  We  first 

consider  the  special  case  where  each  one  of  the  message  functions  is  a  polynomial.  Without 
loss  of  generality,  we  assume  that  the  final  evaluation  of  the  function  /  is  performed  by 
processor  P\.  By  the  definition  of  a  rational  protocol  (cf.  Section  1),  there  exists  a  rational 
function  h  such  that  /(x,  y)  =  h[x,  m\(x,  y), . . . ,  mT(  x,  y))  for  all  (x,  y)  €  Df.  Note  that  h 
can  be  expressed  in  the  form 


/i(x,m1(x,y),...,mr(x,y)) 


hi(x,mi(x,y),.. ,  ,mr(x,y)) 
h2(x,m1(x,y),...,mr(x,y))’ 


where  hi  and  /i2  are  relatively  prime  polynomials.  Let  h'2(x,  y)  =  h2(x,  m1(x,  y), . . . ,  mr(x,  y)) 
The  functions  were  originally  defined  on  Df.  On  the  other  hand ,  since  they  are 

polynomials  they  can  be  uniquely  extended  to  polynomial  functions  on  the  entire  of  Cm+n. 
Furthermore,  the  representation  h'2(x,y)  =  h2(x, mi(x,  y), . . . , mr(x, y))  must  be  also  valid 
over  Cm+n  and  this  implies  that  Cpoiv(h2',  Cm+n)  <  r •  We  now  notice  that  we  must  have 
h'2{x,y)  ^  0  for  all  (x,y)  £  Df,  because  the  function  h  must  be  defined  for  all  (x,y)  £  Df. 
Equivalently,  V {h'2)  c  V(q),  where  q  is  the  denominator  polynomial  of  /,  assumed  irre¬ 
ducible.  Hilbert’s  Nullstellensatz  shows  that  qk  =  h'2g,  for  some  polynomial  g  and  some 
positive  integer  k.  We  factor  the  polynomial  h'2g  as  a  product  of  irreducible  factors.  Since 
q  is  irreducible,  it  follows  that  each  one  of  these  factors  must  be  equal  to  q.  We  conclude 
that  h2  =  cqK  for  some  nonzero  constant  c  £  C  and  some  positive  integer  K,  and  therefore 

r  >  Cpolv{h'2,Cm+n)  =  Cpolv(qK;Cm+n). 

We  now  consider  the  case  where  there  exists  some  t  such  that  the  message  function 
m,  is  not  a  polynomial  and  let  us  choose,  in  particular,  the  first  such  index  «.  Suppose, 
without  loss  of  generality,  that  t  €  Ti_2.  We  have  mj(x,y)  =  m,(x,mi(i,y),..  .,m,_1(x)) 
for  some  rational  function  m,  and  each  one  of  the  functions  mi, . . .  ,m,_!  is  a  polynomial. 
We  write  m,  in  the  form  m,(x,y)  =  hi(x,  y)/h2(x,  y),  where  h\  and  h2  are  relatively  prime 
polynomials.  We  now  repeat  the  argument  of  the  preceding  paragraph.  Since  the  domain 
of  the  protocol  is  all  of  Df,  it  follows  that  V(/i2)  C  V(q)  and  h2  =  cqK  for  some  nonzero 
constant  c  £  C  and  some  positive  integer  K.  Furthermore,  it  is  clear  that  h2  can  be 
expressed  as  a  polynomial  function  of  mi(x,y), . . .  ,m,_i(x,  y)  which  implies  that  r  >  i  - 1  > 

CPclM-,Cm+n)  =  Cpolv(qK;Cm+n). 

To  summarize,  we  have  shown  that  in  both  cases  that  there  exists  a  positive  integer  K 
for  which  Crat{f',Df)  —  r  >  Cpoiy(qK ;Cm+n).  It  now  remains  to  derive  a  lower  bound  on 
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CPoiv(qK \Cm+n).  To  this  effect,  we  apply  Theorem  5.1.  We  have 
CPoiy(qK-,Cm+n)  >  C2(qK-Cm+n) 

*  (lXcX-rank^(?ff)l(1^ 

+K[K  -  l)qK~i(x,  y)(VIg(i,y))(V„g(i,  y))Tj  (5.6) 

-  (*,lOCC"*+n  rank  S/) -^*I/C9) l(x,y))  —  1  (5-7) 

-  (,>tfrc-"rank^(9)i(^-1’  (58) 

Here  the  first  equality  (5.6)  is  a  simple  calculation  and  the  next  step  (5.7)  is  due  to  the 
fact  (Vxg)(Vyg)r  has  rank  at  most  1.  The  last  step  is  obtained  as  follows.  The  set 
{(x,y)  |  q(x,y )  ^  0}  is  a  Zariski  open  set.  Furthermore,  the  maximum  rank  of  Hzy{q) 
is  attained  at  the  set  of  points  where  the  determinant  of  a  suitable  submatrix  of  Hzy[q ) 
does  not  vanish  and  is  also  a  Zariski  open  set.  Since  every  two  nonempty  Zariski  open 
sets  have  nonempty  intersection  (Theorem  4.2),  it  suffices  to  consider  a  vector  (x,y)  in  the 
intersection  of  these  two  sets. 

b)  Let  (x,y)  be  an  arbitrary  vector  of  Dj.  Note  that 

*.,(£)  =  -Hzy{p)  -  %Hzy(q)  -  1  (VlP)r  (Vy?)  +  %  (Vtq)T  (Vy9) . 

q  q  q  q*  q> 

By  evaluating  the  rank  of  both  sides  at  (x,y)  and  noticing  that  both  (Vxp)  (Vyq)  ](2 yj 
and  (V*g)T  (Vvq)  |(Iy)  have  rank  at  most  1,  we  see  that 

ranktfiy(^)|(ly)  >  rank//iy(p)j(l  y)  -  ranktfiy(?)|(iy)  -  2. 

Therefore, 

Crat(f-,Df)  >  C2(/;D/) 

>  rankffIy(^)|(iy) 

>  rank/fly(p)|(iy)  -  ranktfiy(9)|(ly)  -  2 

>  rank/fiy(p)|(ly)  -  CTat{f\  Df)  -  3,  V(x,  y)  €  Df, 

where  the  last  step  follows  from  Eq.  (5.4).  After  rearranging  the  above  inequality,  we  see 
that 

Crat{f  \  Df)  >  ^ranktfIy(p)i(l  y)  -  V(x,  y)  €  Df.  (5.9) 
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Since  max(Ii(l)e^+n  rank Hzy[p)  is  attained  at  a  Zariski  open  set  and  Dj  is  also  a  Zariski 
open  set,  by  Theorem  4.2,  there  exists  some  vector  (x*,y*)  G  D/  such  that 

^max^ranktf^p)  =  rank  H«y(p)|(,.iV.j. 

Now  Eq.  (5.5)  becomes  evident  when  one  considers  (5.9)  at  (z*,y*).  Q.E.D. 

6  An  fi(n2)  Lower  Bound  for  Computing  [(x  +  y)_1]n 

Let  x  and  y  be  n  x  n  matrices.  As  an  application  of  the  results  of  Section  5,  we  consider 
the  communication  complexity  of  the  function  /(x,  y)  =  [(x  +  y)_1]n  (the  (l,l)th  entry  of 
(x  +  y)-1)  within  the  class  of  rational  protocols.  While  Abelson’s  lower  bound  is  only  f2 (n), 
we  derive  a  lower  bound  of  n2  —  1,  which  is  almost  equal  to  the  obvious  upper  bound  of  n2. 
In  particular,  this  example  will  show  that  Abelson’s  bound  can  be  far  from  tight. 

We  motivate  our  choice  of  the  problem.  The  value  of  [(x  +  y)_1]n  can  be  thought  of 
as  the  solution  of  the  system  of  linear  equations:  (x  +  y)u  =  6,  where  b  —  (1,0, ...  ,0)  and 
u  is  the  unknown.  Thus  the  problem  under  consideration  captures  the  essential  difficulties 
of  a  distributed  solution  of  a  system  of  the  form  (x  -f  y)u  =  b,  when  x  and  y  are  possessed 
by  different  processors.  Since  the  solution  of  linear  systems  of  equations  is  the  most  basic 
problem  in  numerical  computation,  the  problem  we  are  studying  is  an  interesting  paradigm. 

It  is  easy  to  see  that  n2  messages  would  be  needed  if  we  had  required  that  a  particular 
processor,  say  Pi,  should  eventually  evaluate  all  entries  of  the  inverse  matrix  (x  +  y)_1. 
(This  is  because  Pi  could  then  invert  (z  +  y)-1  to  obtain  x-)-y  and  use  its  knowledge  of  x  to 
infer  the  value  of  y,  and  this  is  possible  only  if  at  least  n2  messages  have  been  exchanged.) 
However,  the  fact  that  the  evaluation  of  the  whole  inverse  matrix  (x  +  y)-1  is  hard  does 
not  imply  that  the  computation  of  a  particular  entry  is  also  difficult.  In  fact,  we  shall  see 
that  the  derivation  of  tight  bounds  on  the  communication  complexity  of  [(x  +  y)-1]n  is 
surprisingly  hard.  As  a  first  indication,  we  show  that  Abelson’s  result  (Theorem  5.1)  gives 
only  an  ft(n)  lower  bound. 

Theorem  6.1  Let  f(x, y)  =  [(x  +  y)_1]u.  Then 

max  ranktfI(,(/)|(Xi„)  <  3n.  (6.1) 

( z,y)£Df 

Proof:  Let  us  fix  a  pair  p  —  (xo,yo)  6  £>/  of  nx  n  matrices.  We  will  show  that  the  rank 
of  Hty(f)\p  is  at  most  3n.  Let  A1(  A2  be  two  n  x  n  perturbation  matrices.  We  consider  the 
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Taylor  series  expansion  of  /  at  the  point  p: 

/(io  +  Ai, yo  +  A2)  =  ((io  +  Vo)  “  (Ai  +  A2))  1]n 

=  (xo  +  yo)  J] n  +  [(^o  +  yo)  !(Ai  +  A2)(xo  +  yo)]n 
+[(xo  +  yo)-1  ((Ai  +  A2)(x0  +  yo))*]n  +  •  •  • 

Notice  that  the  value  of  Hxy{f) |p  is  completely  determined  by  the  second  order  terms  of 
this  expansion.  Thus,  if  we  let 

y(Ai,  A2)  =  [(xo  +  yo)  1  ((Ai  +  A2)(xo  +  yo))2]  11 , 

then  Hzy{f) |(*0,vo)  =  (ff)!(o,o)-  Therefore,  we  only  need  to  show  that 

rank/fAlAa(s)!(010)  ^  3n.  We  will  present  a  two-way  polynomial  protocol  for  computing  g 
that  only  uses  3 n  messages. 

Notice  that  as  far  as  the  computation  of  g  is  concerned,  the  matrices  io,yo  are  constant 
and  the  matrices  A,  (t  =  1,2)  are  the  inputs.  Let  e  =  (1,0, .. .  ,0)T.  The  protocol  proceeds 
as  follows. 


1.  Processor  Pi  sends  the  vector  Ai(xo  +  yo)e  to  processor  Pi  (n  messages). 

2.  Processor  Pi  computes  (Ai  +  A2XX0  +  yo)e  and  sends  the  following  two  vectors  (2n 
messages)  to  P\\ 

(Ai  +  A2)(x0  +  yo)e 


A2{xq  +  yo)(Ai  +  A2)(xq  +  yo)e- 


3.  Once  processor  Pi  receives  these  messages,  it  can  use  its  knowledge  of  Ai  to  evaluate 
((Ai  +  A2)(x0  +  yo))2  e.  It  follows  that  y(Ai,  A2)  =  [(x0+yo)_1  ((Ai  +  A2)(x0  +  yo))2]n 
can  also  be  evaluated  by  Pi. 


By  Abelson’s  result  (Theorem  5.1),  we  see  that  for  any  open  set  D  containing  (0,0),  we 
have 

ranktfA,A3(fl) |(o,o)  <  Cpo,v(g;D )  <  3n, 
which  completes  the  proof.  Q.E.D. 

Let  Df  be  the  set  of  all  (x,y)  6  C"2  x  C"2  at  which  the  rational  function  /(x,y)  = 
[(x  +  y)-1]  11  is  well-defined.  Clearly,  Df  is  the  same  as  the  set  of  all  (x,y)  such  that 
det(z  +  y)  ^  0.  Our  main  result  is  the  following. 
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Theorem  6.2 


Cfat(/;L>/)>n2-l. 


(6.2) 


The  proof  is  based  on  two  lemmas: 

Lemma  6.1  The  polynomial  g(x,y)  =  det(z  +  y)  is  irreducible. 

Lemma  6.2  Suppose  that  n  >  1  and  let  g[x, y)  =  det(z  +  y).  Then  the  rank  of  Hzv(g) 
evaluated  at  {1,0)  (7  is  the  identity  matrix)  is  n2. 

Once  these  two  lemmas  are  proved,  the  desired  result  is  obtained  as  follows.  If  n  =  1, 
then  Eq.  (6.2)  holds  trivially.  For  n  >  1,  we  have  f{x,  y)  =  detn  (z  +  y)/det(z  +  y)  = 
detn(z  +  y)/g{x,y),  where  detn(z  +  y)  is  the  cofactor  of  the  (l,l)th  entry  of  z  +  y.  It  is 
seen  that  g{x  +  y)  does  not  divide  detn(z  +  y),  because  otherwise  [(z  +  y)-1]n  would  be 
a  polynomial  in  the  entries  of  z  and  y,  which  is  easily  shown  not  to  be  the  case.  Since  g 
is  irreducible  (Lemma  6.1),  we  conclude  that  the  polynomials  detn(z  +  y)  and  g{x,y)  are 
relatively  prime.  Then,  Theorem  5.8  applies  and  shows  that 

Crat{f  ]  Df)  >  max  Hxv{g) |(liV)  -  1  >  n2  -  1, 

(i,v)€CJn3 

where  the  last  inequality  has  made  use  of  Lemma  6.2.  Thus,  it  only  remains  to  prove  the 
two  lemmas. 

Proof  of  Lemma  6.1:  If  n  =  1,  then  g(x,y)  =  x  +  y,  which  is  obviously  an  irreducible 
polynomial.  For  n  >  1,  we  assume,  in  order  to  derive  a  contradiction,  that  /(z, y)  = 
A(x,y)B(x,y)  where  A,  B  are  nonconstant  polynomial  functions  of  the  entries  of  z,y.  Let 
Zjj  (respectively,  y,y)  denote  the  (*,j)th  entry  of  z  (respectively,  y).  Let  us  restrict  z  and 
y  by  letting  zi,  =  -yi,  =  1 , *  =  2,...,n.  With  such  a  restriction,  /,  A,  and  B  can  be 
expressed  as  polynomials  /,  A,  and  B,  respectively,  of  the  unrestricted  variables.  Note  that 

/(z,y)  =  (zu  +  yn)detn(z  +  y)  =  A(z,y)S(z,y). 

By  the  unique  factorization  property  of  polynomials,  we  see  that  (in  +  yu)  must  be  a 
factor  of  either  A(x,  y)  or  B(z,y).  Since  det(z  +  y)  is  a  linear  function  of  zji  +  yn,  we 
conclude  that  zn.yn  appear  together  in  either  A  or  B,  but  not  in  both.  It  then  follows 
that  zn,yn  appear  together  in  either  A(z,y)  or  B(x,y),  but  not  in  both.  Repeating  our 
argument  for  all  [i,j)  (1  <  i,j  <  n),  we  see  that  either  z,y  and  ytJ  both  appear  only  in 
A(x,y)  or  they  both  appear  only  in  B(x,  y).  Therefore  the  set  {(»,.?'),  i,j  =  l,2,...,n} 
can  be  partitioned  into  two  subsets  7?i,Rj  (with  7?i  being  nonempty)  such  that  A{x,  y) 
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depends  only  on  the  entries  Xij , y^  with  (t ,j)  E  Ri  and  B{x,y)  depends  only  on  the  entries 
X\jt  Vij  with  (i,j)  E  R2  .  Let  us  express  each  one  of  the  polynomials  A  and  B  as  a  sum  of 
products  and  then  carry  out  the  cross-multiplications  to  expand  A(x,  y)B(x,  y)  as  a  sum 
of  products.  Since  A  and  B  depend  on  different  entries,  it  is  seen  that  this  expansion  leads 
to  no  cancellations.  Hence,  if  (i,j)  is  in  Ri  then  (t,fc)  and  ( k,j)t  k  =  1  ,...n,  also  belong 
Ri,  since  otherwise  there  would  be  a  term  in  the  expansion  of  A(x,y)B(x,y)  =  det(x  +  V) 
with  two  entries  from  the  same  row  or  column.  This  implies  that  all  of  the  entries  must  be 
in  Ri,  and  Rj  is  empty.  Consequently,  B[x,y)  is  a  constant  polynomial,  which  contradicts 
our  original  assumption.  Q.E.D. 

Proof  of  Lemma  6.2:  An  easy  calculation  yields 

1  if  »  =  j,  l  =  m  and  «  7^  l 
- 1  if  t  =  m,  j  =  l  and  i  ^  l 
0  otherwise. 

Thus,  if  the  rows  and  columns  of  Hzv{g)\(i,o)  are  suitably  rearranged,  the  matrix  #zV(ff)|(/,o) 
has  the  structure  shown  in  Fig.  1.  It  is  not  hard  to  see  that  this  matrix  is  nonsingular  and 
therefore  has  rank  n2. Q.E.D. 

We  would  like  to  be  able  to  strengthen  Theorem  6.2  in  a  number  of  directions.  First, 
Theorem  6.2  refers  to  the  computation  of  [(x  +  y)_1]n>  where  x,  y  are  complex  matrices. 
This  does  not  lead  to  a  lower  bound  when  we  restrict  x  and  y  to  be  real,  even  though  this  is 
the  case  of  main  practical  interest.  A  related  deficiency  is  that  the  lower  bound  apphes  only 
to  protocols  whose  domain  is  equal  to  all  of  Dj.  It  would  be  interesting  to  know  whether 
the  communication  complexity  of  the  problem  can  be  reduced  by  an  order  of  magnitude 
when  we  restrict  to  real  matrices,  or  if  we  only  consider  the  evaluation  of  /  in  an  open  set 
real  matrices.  We  conjecture  that  this  is  not  the  case,  but  we  are  not  aware  of  any  proof 
technique  that  could  lead  to  such  a  result. 

One  possible  approach  for  proving  a  stronger  lower  bound  is  based  on  Theorem  5.6  of 
Section  5.  This  result  shows  that  an  fl(n2)  lower  bound  will  be  established  if  we  manage 
to  find  a  pair  (x,y)  of  matrices  such  that  y(x,y)  =  0  and  rankf/tv(y)|(IiV)  =  fl(n2),  where 
y(x,y)  =  det(x  +  y).  Unfortunately,  the  determinant  function  is  particularly  nasty  in  that 
respect.  It  can  be  shown  [L  89]  that  the  rank  of  Hty{g)  is  nJ  at  each  point  (x,y)  such  that 
x  +  y  is  invertible  but  it  is  no  more  than  3n  +  3  at  each  point  (x,y)  at  which  y(x,  y)  =  0. 

Finally,  let  us  mention  that  an  f 2(n*)  bound  can  also  be  obtained  for  the  special  case 
where  x  and  y  are  restricted  to  be  symmetric  matrices.  The  proof  is  similar  to  the  proof  of 
Theorem  6.2. 


d*9 


dxijdyi. 


(f.0) 


=  < 
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7  Conclusions  and  Extensions. 


We  have  presented  a  variety  of  new  results  on  the  one-way  and  two-way  communication 
complexity  for  algebraic  problems.  We  have  used,  in  several  occasions,  the  results  of  [A  80], 
but  our  results  are  often  stronger  because  they  exploit  the  algebraic  structure  of  the  problem. 

There  are  several  directions  for  further  research  on  the  subject.  One  direction  concerns 
the  derivation  of  lower  bounds  on  two-way  communication  complexity  that  involve  infor¬ 
mation  other  than  the  second  order  derivatives.  (One  such  result  can  be  found  in  [TL  89] . ) 
Another  direction  concerns  two-way  protocols  for  computing  a  collection  {/i,  ...,/„}  of 
functions,  with  s  >  1.  Here,  even  if  one  assumes  that  the  functions  /,  are  quadratic,  the 
evaluation  of  the  communication  complexity  is  surprisingly  hard  and  leads  to  problems 
with  a  combinatorial  flavor.  (Some  partial  results  can  be  found  in  [L  89].)  A  final  direction 
concerns  “multi-party”  protocols  in  which  more  than  two  processors  are  involved.  There 
is  very  little  literature  on  this  subject  [CF  83]  and  it  is  not  completely  clear  what  are  the 
interesting  problems  in  this  area. 
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